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Introduction 


In the present paper we continue our study of a class of (associative) algebras 
called Frobeniusean.' Chapter I, which is short, is a direct sequel of Part I and 
deals chiefly with a certain type of automorphisms in a Frobeniusean algebra. 
The automorphisms enable us to generalize and refine some of the theorems in 
Part I, and also clarify the significance of symmetric algebras. Further, an 
application of our results to the theory of Galois moduli and normal bases over 
modular fields is discussed. In Chapter II we make an attempt to extend our 
theory from algebras to general rings satisfying chain conditions. To do so we 
adopt the properties of I, 2, Lemma 2 as a definition of Frobeniusean and quasi- 
Frobeniusean rings. Then many of the theorems in I can be extended to this 
general case. But since this definition does not have much significance for 
itself, contrary to the case of algebras, Chapter II may be considered as a study 
of the structure of those rings in which the annihilation gives a 1-1 correspond- 
ence between left and right ideals.” Sections 6, 7 aim, however, not only at 
Frobeniusean rings but at Frobeniusean algebras. 


CuapTer I: A FurtTHER Stupy or FROBENIUSEAN ALGEBRAS 


1. A class of automorphisms in a Frobeniusean algebra 


Let A be a Frobeniusean algebra over a field F. Let a, a2, ---,@, bea 
basis of A with a multiplication table 
(1) a,Aag = > Apor A, « 


There exists a vector (A,) such that the corresponding parastrophic matrix 
P = (>>, ayerAr) is non-singular. We consider again the linear form A(x) = 
> re (2 = >> £,a,¢A) and the non-singular hyperplane H defined by 
A(z) = 0. Fora set S of elements in A we denote by pu(S)[qu(S)], as before, 
the set of all z satisfying zS GC H[Sx € H]. Any other non-singular hyper- 
plane H’ in A can be given as H’ = px(c) = He with a regular element c, 
and conversely with any regular element c Hc is a non-singular hyperplane in A 
(Cf. I, 6). 





1 On Frobeniusean algebras. I, Ann. Math. 40 (1939) referred to as Part I, or simply as I. 
? M. Hall (7) calls them (weakly) closed rings. 
1 
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THEOREM 1. There exists a uniquely determined automorphism ¢: x — x* of A 
such that 


(2) Aa*y) = A(yzr) (that is, x*y — yx « H) 


for allye¢A. H remains invariant under g; H* = H. 

y's corresponding to different choices of non-singular hyperplanes in A form a 
co-set with respect to the invariant subgroup consisting of all inner automorphisms 
in the automorphism group of A. 

Proor. Ifx = >> £,a, we put (§) = P(P’)“(g,) and «* = )>éa,. Then’ 
A(a*y) = (&*)P(n) = (€)P’(n) = (n)P(E) = Myx) where y = 3) 7,a,. This 
shows the existence of a mapping ¢: x — 2* such that (2) is true for all 2, y € A. 
¢ is uniquely determined by this property as we can see easily by reversing the 
above computation and observing particularly that P is non-singular. The 
mapping is evidently 1-1, and that (ax + 8z)* = axz* + Bz* is obvious. More- 
over, the relation \(x*z*y) = A(z*yx) = A(yxz) shows that (xz)* = x*z*. Hence 
gisan automorphism. Now, x e H implies 2* ¢ H, since z* — x = 2*1 — Ire H. 
Thus H* = H. 

To prove the second part of the theorem, consider a second non-singular 
hyperplane H’ = Hc (c regular). Since (cxe')*y — yx = ((cxce)*yo* — 
yc (cae '))e ¢e Hc = H' for all z, y ¢ A, the automorphism ¢’ corresponding to H’ 
is given by x — (cre ’)*. 

Lemma 1. The automorphism ¢ maps qu(S), with any set S in A, onto px(S); 
du(S)* = pu(S). In particular r()*(= r((*)) = pr(l) for a left ideal { in A.* 

Proor. Immediate from the definition and I, Lemma 4. 

Thus the automorphism ¢ correlates the two kinds of dual correspondences between 
left and right ideals in A; one is given by annihilation and the other is repre- 
sentation-theoretical. Namely 

TuroreM 2.’ If I, lo are left ideals in A and | => |, the representation of A 
defined by the left module {/l) 1s equivalent to the one defined by the right module 
r(lo)*/r()*. Furthermore, the representation defined by a left principal ideal Ac 
is equivalent to the one defined by the right ideal c*A. 

Proor. The theorem follows from the above lemma and I, 8, Lemma 6. 
For the second half cf. also the proof of I, 5, Th. 4. 

THEOREM 3. The automorphism algebra of the left module {/\[Ac] is inversely 
isomorphic to that of the right module r(lo)/r(){eA].° 

Proor. The automorphism algebra of a left [right] module is inversely 
[directly] isomorphic to the algebra of matrices which commute with all the 
matrices in a representation belonging to the module. It follows thus from 
Theorem 2 that the automorphism algebra of [/f is inversely isomorphic to 





§’ Consult the form of the parastrophic matrix P. Cf. also I, 3. 

4r(S) denotes, as before, the set of right annihilators of S in A. 

5 Cf. Th. 13 in the section 6 below. 

6 The last statement of I, Th. 5 is a very special case of the present theorem. 
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that of r(b)*/r()*. But this latter is evidently isomorphic to the automorphism 
algebra of r(fo)/r(l). 

Now, let Ey, és (« = 1, 2,---,k; ¢ = 1, 2,--- ,f(k)) and w(x) (k = 1, 
2, --- , k) have the same significance as in I. In particular Uy ~ Vey. Then 

TuroreM 4. Ey) = E,. With a suitable choice of H we have furthermore 
Cre) i = €,i (for all Ky i)." 

Proor. Obviously Ex = KE), for a certain ». But the representation 
defined by E,A must be equivalent to the one defined by AE,,). Hence 
} = x. To prove the second half we observe that we can choose H so that 
€x,:Aex,; & A if (w(x), 7) ¥ (A, J); this can be seen in exactly the same way as 
in Nakayama-Nesbitt (11). With such a choice of H we have indeed e;).; = 
€xi- For, (ex) = Adon. 5 xin, 5) = (Ex, :LCxe(n),i) = NOS ee) = 
A(LEx(x) i) 

Another immediate consequence of Lemma 1 is that if 3 is a two-sided ideal 
in A then 


U3) = r3)* = r(3*). 


In particular if ; = 3*, or if more particularly 3 is invariant under all auto- 
morphisms of A, then 1(3) = r(j3). (This proves again I, Th. 6 in case where 
the algebra is not only quasi-Frobeniusean but Frobeniusean.) We find more- 
over that we can put d = c* in I, Th. 9, Corollary. 

We note further that if S(z) and R(x) denote, as before, the left and the right 
regular representations of A with respect to the basis a , dz, --- , a, then we 
have the relation 


R(xz)P’ = P’S(a*) 


as a counterpart of the fundamental relation R(xz)P = PS(a). For, (a; , 
.++,@n) = (a, --- ,@,)(P’)P, as can be seen from the proof of Theorem 1, 
and if we put R(x) = R(v *(z)) then x > R(z) is the right regular representation 
with respect to the basis (a;), whence R(x) = P’P'R(x)(P’P™)" 
P’S(z)(P’)”. But this is nothing but the above relation. 

For an interesting application of the automorphism ¢ to a generalization of 
the orthogonality relation among representation coefficients, see a forthcoming 
paper by C. Nesbitt. 

Remark. Our algebra A is symmetric if and only if the automorphism ¢ is 
inner. The above treatment of Frobeniusean algebras can be considered as an 
extension of the section 9 in I. 

ExamMpPie. Cartan’s algebra of outer multiplication’ is Frobeniusean. Namely, 
let m be any natural number and consider an algebra A over a field F which is 
generated by F and m elements ¢ , C2, --+ , Cm and in which the law of compo- 





’ The automorphism ¢~ effects the permutation 7, so to speak. 
8See for instance Cartan (19). This example was suggested to the writer by 
C. Chevalley. 


ee 














4 TADASI NAKAYAMA 


sition is given by c; = 0, cc; = —c,c; fori # 7. This algebra A, of Cartan’s 
outer multiplication, has a basis 1, ¢i,¢i, --- ci, (4 < t2 <--- < 4%), and is 
Frobeniusean. For, if H is the hyperplane in A consisting of those elements 
which have a vanishing coefficient for the term ¢iC2 --- Cm when expressed by 
the above basis, then evidently H does not contain any ideal except the zero 
ideal. The automorphism ¢ belonging to the same H is defined simply by 
Gog = (—1)”""e; . If the characteristic of F is different from 2, then A is 
symmetric or not according as m is odd or even.” 


2. Galois moduli over modular fields 


Let K be a (finite and separable) Galois extension over F,, and let @ be the 
Galois group of K/F. Looking upon the elements of & as the right operators 
on K, we can consider K/F as a right representation module of the group algebra 
@(F) of @ over F. The well-known theorem of normal bases” states that this 
module is (operator-) isomorphic with @(F) itself, that is, K/F defines a repre- 
sentation of G equivalent to the regular representation. Furthermore, the 
image of a left ideal of G(/) by such an (operator-) isomorphism between G(F) 
and K/F is independent of the special choice of isomorphism, and such an image 
of a left ideal of G(F) in K is called a Galois module of K/F." 

Let K, be a field between F and K; K 2 K, 2 F, and let § be the subgroup 
of G belonging to K,. The group algebra $(K,) of § over Ki is then (operator-) 
isomorphic to K/K, with respect to the right operator algebra (Ki), and 
Galois moduli of K/K, are defined in the same way as above. Now, the fol- 
lowing interesting theorem was proved in Deuring (21) under the assumption 
that the group algebra G(F) is semisimple or, what is equivalent, that the 
degree (K:F) is not divisible by the characteristic of F: 

THEoREM 5. Let m be a Galois module of K/F such that it is a Ky-module. 
Then m is also a Galois module of K/K;, and moreover, the representation of © 
obtained from the left ideal of G(F) corresponding to the K/F-Galois-module m is 
equivalent to the one induced from the representation of the subgroup $ obtained 
from the left ideal of S(K,) corresponding to the K/K,-Galois-module m. 

The purpose of the present section is to note that this theorem holds without 
assuming the semisimplicity of G(F). 

Proor. Let [be the left ideal of G(F’) which corresponds to our K/F-Galois- 





° A can be considered also as the algebra of chains contained in an (absolute) simplex 
spanned by m vertices ¢1 , C2, -++ , Cm. 

10 Noether (26), Deuring (20), (21), Brauer (18). Artin has given another simple and 
elegant proof. 

1! Deuring (21). It seems to the writer that there is a slight confusion in the usage of 
the term “Galois module.’’ Noether defined first a Galois module as an image of a right 
ideal. Deuring used essentially the same definition in his paper (20), but switched in his 
second paper (21), as well as in his book (5), to the definition which we are adopting in 
the present paper. 
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module m. {| consists of all left annihilators of r = r(l) (I, Th. 1. Cf. also I, 9), 
and thus Deuring’s method can be transferred term by term to our general 
case. Namely, m consists of all elements a in K satisfying 


(3) ds a°Bs = 0 forall S-s SBs er. 


Due to our assumption that m is a K,-module, we have 


Ds h*a*Bs on Yr hn” > Pp Ps) a’"Bpr = 0 


for any h in K,, where © = })7 HT is the right co-set decomposition of © 
with respect to §. Applying this relation to a system of basis elements /, , 
he, ---,hm of Ki/F and noticing that the discriminant | hj |’ # 0, we find 
Dr a’ Ber = 0, or 


(4) as a’ Bpr = 0 (d's SBs € r) 


for all T (mod ). Since conversely (4) implies (3), the elements a of m can be 
characterized also by (4). Hence the image of m by an isomorphism between 
K/K, and $(K;) consists of all left annihilators of the elements }>» PBrr 
(>>s Ss ¢r), and is a left ideal in §(K,). Hence m is a Galois module of 
K/K,. The second half of the theorem can be proved also in the same way 
as in the original paper of Deuring (p. 46).” 


3. Appendix: On normal bases 


Deuring’s second proof of the theorem of normal bases, which was published 
in the same paper Deuring (21) also under the assumption that the characteristic 
does not divide the degree of extension, can be so modified that it works gen- 
erally. Furthermore, R. Stauffer’s’’ method for constructing a normal basis 
works, after a modification, also without that same assumption. These will 
be seen in the following. 

Let G be, as above, the Galois group of a Galois extension K/F. On taking a 
sufficiently large over-field F’ of F such that all absolutely irreducible repre- 
sentations of © lie in F’, we consider the algebra K’ = K X F’ over F’ instead 
of K itself. It is a right representation module of @ and we denote by M(S) 
(S «@) the corresponding representation of @. Let G(S) be an irreducible 
representation of @ (in F’) with a degree, say g, and let U(S) be the corre- 
sponding directly indecomposable part of the regular representation of @G; 
U(S) has G(S) as its first and its last irreducible as well as largest completely 
reducible components. We wish to show that U(S) is contained in M(S) (at 
least) g times.—If this is proved for each irreducible representation G(S), then 
we find that the regular representation of G is contained in, whence coincides 





2 Theorem remains true if we, as in Deuring (21), extend the underlying field F to an 
over-field F’ and consider K X F’, K, X F’ instead of K, K, . 
13 Stauffer (28). 





Bet 
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with, M(S) and thus we are through. *—That U(S) is contained in M(S) g times 
is equivalent to that there exists in K’ a matrix B with g columns, satisfying 


(5) BS = U(S)B forall Se@ 


and whose elements are all linearly independent (with respect to F’). Now, if 
we denote by B, the matrix consisting of the first g rows of a matrix B satis- 
fying (5), then B, fulfills B] = G(S)B, for all S." Furthermore, since the 
representation module of U(S) possesses a unique simple submodule (corre- 
sponding to G(S)), it follows easily that in order to show that all the elements 
of B are linearly independent we have merely to show that the elements of B, 
are linearly independent. And, in order to secure the latter we have in turn 
only to make sure that all the columns of B, are linearly independent (with 
respect to F’); this is due to the fundamental fact that every automorphism of 
an (irreducible) representation module of G(S) can be obtained by multiplica- 
tion with an element of F’ (cf. Deuring (21), pp. 43-44 and Stauffer (28), pp. 
592-593). Hence our aim is to prove the existence of B with g columns, satis- 
fying (5) and such that the g columns of B, are linearly independent with 
respect to F’. 

For this purpose, we can first apply Speiser’s theorem. (In a generalized form 
the theorem asserts: Let Mz, Ms,---,Mr and Nz, Ns,---,Nr (@ = 
{E, S,---, T}) be two systems of non-singular matrices with elements from 
K’ such that 


T —1 id —l1 
Ms M7rMsgr = Ng NrNsr 


for every pair S, 7’ «@. Then there exists a non-singular matrix C which satis- 
fies CSMsC"' = Ns for all Se@.)"* Namely, there is a non-singular square 
matrix C fulfilling CSC” = U(S) (that is, C’ = U(S)C) for all S. We choose g 
columns in such a C so that the square matrix Cy consisting of g’ elements at 
the intersections of those g columns with the first g rows is non-singular; this is 
obviously possible. Denote by B the matrix consisting of the chosen g columns 
of C. Then this B satisfies the above requirements, because (5) is evidently 
valid and the corresponding B, is simply C, whose g columns are linearly inde- 
pendent even with respect to K’. 





4 If a representation contains U(S) then U(S) is a direct constituent. See Nakayama- 
Nesbitt (11), §2. 

15 Here we agree that U(S) has already assumed the form where the right upper part is 0, 

16 This can be proved in exactly the same manner as in the special case Schur (27) (or 
Weyl (30)), provided that F’ contains sufficiently many elements. The case where this 
last condition is not satisfied can be reduced to a favorable case by an argument due to 
R. Brauer and E. Noether (cf. Deuring (20) or Van der Waerden (29), p. 70). 

The present form, which is more general than needed here and which is also more general 
than the one given in Deuring (21), for instance, has a significance for the classification of 
semi-linear transformations; cf. Nakayama (25), Haantjes (22). 
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A second method to obtain a relevant B is to consider a matrix 
B= Biz) = Dis O(S")e*, 2K, 


where by U(S') we denote the matrix composed of the first g columns of 
U(S”"). This B = B(z) certainly satisfies (5), and the corresponding B, is 
Biz) = }osG(S")z*. Now, B,(z) satisfies not only the relation B,(z)’ = 
G(S)B,(z) but also the relation 


B,(z") = B,(z)G(S). 


From this fact we can conclude that all the elements of B,(z) are linearly inde- 
pendent (with respect to F’) whenever one of them is different from 0, that is, 
whenever B,(z) ~ 0. But from the separability of K/F we can see easily that 
there exists certainly azeK such that Bi(z) #0. (For all this cf. Stauffer (28)). 
Thus B = B(z) with such a z possesses the desired properties. 

Furthermore, let G”, G®, .-.- ,G°” be the totality of distinct irreducible 
representations of @ in F’, and take for each G(S) = G™(S) an element z = 
z” (eK) satisfying the above condition. Here we can, and shall, choose these 


i ) . 4 é 
z™ in such a manner that 2” = 2” if G” and G” are conjugate with respect 


to F. Let now G,, Gz, --- ,G, be the totality of distinct irreducible repre- 
sentations of G in F, and put z = 2” if G, contains G“’. Let further E,, 
E,, --- , Ey be mutually orthogonal idempotent elements in the group algebra 


G(F) with the sum >> E, = E = 1 such that if N denotes the radical of @(F) 
then E, (mod N) is the unit element in the simple two-sided ideal of G(F)/N 
belonging to G,. (#,G@(F) is a direct sum of right ideals which define the 
component U, of the regular representation corresponding to G,.) Then w = 
>>. 2 “(e K) and its conjugates form a normal basis of K/F. This can be seen 
in quite a similar way as in Stauffer (28). 


CHAPTER II: FROBENIUSEAN RINGS 


4. Frobeniusean and quasi-Frobeniusean rings 


Consider a ring A. A may have a left, say, operator domain Q such that 
a(a + b) = aa + ab, a(ab) = (aa)b = a(ab) ifaeQanda,beA. Incase A 
satisfies both the minimum and the maximum conditions for left and right 
ideals (allowable with respect to Q), its general structure is well known. But 
recently C. Hopkins showed that the theory remains valid to a large extent in 
the case where A fulfills the minimum condition only.” We assume in the 
present treatment also merely the minimum condition. And, since we shall deal 
exclusively with such type of rings, we shall understand by a ring always a 
one of the type. 

A possesses the radical N which is nilpotent and the residue class ring 
A = A/N is semisimple. Everything in Part I, 1 remains valid® for our A 





17 Hopkins (23), (24). 
18 With trivial modifications of terminologies, of course. 
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except the second half of Lemma 1 and the statements concerning the com- 
pletely reducible ideals. Thus we retain the old significances of the symbols k, 
S(k), Cee Cx(= Cea), Ee = © i, E = iw Ex, Cx,i; (« = 1, 2,---,k; 
i,j = 1, 2,---,f(x)). Namely 


A = A, +A2.+--- + Ax, 


where each A, is simple and has a unit element E, = E, (mod N); &,; are 
mutually orthogonal idempotent elements, and (ii = Ce,i, Cx,i?,n1 = 
55 jrCx,i2. AS to the connection between r(N) and completely reducible left 
ideals, we can assert that any completely reducible left ideal is contained in 
r(N), while r(N) is a direct sum of a completely reducible left ideal and a left 
ideal which is annihilated by A; this last left ideal is not necessarily completely 
reducible this time.” 

Remark. If A possesses a unit element then it satisfies also the maximum 
condition for left and right ideals.” Moreover, the same assumption assures of 
course that r(N) is really the largest completely reducible left ideal. 

If mis a simple left module of A, then we denote by d,(m) the rank of m with 
respect to the quasi-field of automorphisms. Hence, if m ~ Aé = Ae,/Ne, 
then d;(m) = f(x). More generally, if m is a left module which has a com- 
position series m = m D m, D--- D ms DO, then we put dm) = 
>-j-1d:(m,1/m,). For a right module n possessing a composition series we 
define d,(n) in the same manner. 

Now, since a direct generalization of the definition of Frobeniusean and 
quasi-Frobeniusean algebras to our general case seems difficult, let us, in view 
of I, Lemma 2, define the corresponding types of rings as follows: 

DEFINITION. A is called quasi-Frobeniusean if it possesses a unit element 
and if there exists a permutation ((1), (2), --- , r(k)) of (1, 2, --- , &) such 
that for each x 

i) eA has a unique simple right subideal r, and tr, ~ @A, 

ii) Aes) has a unique simple left subideal t) and bo 2 A&. 

If moreover 

iii) f(x) = f(r(k)), 
then we call A Frobeniusean. 

Remark.” In the above definition we can omit the condition (a) 2 Aé, in 
ii). For, if the other conditions are satisfied, that is, if A has a unit element and 
if i) and the first part of ii) are the case, then ela = ed (N)er) 2 tery ¥ 0, 
whence necessarily by) 2 Aé. 


1° This is due to our not assuming anything about the structure of Q. 

20 See Hopkins (24). As a matter of fact, our main interest in the present paper lies 
in such an A which, either by its definition or as a consequence of its definition, possesses 
a unit element. Hence our present avoidance of maximum condition is not so important, 
according to this remark. 

2a See a correction at the end (Added in proof). 
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We have now the following theorems which correspond to Th. 1, 2 and 3 in 
Part I: 
THEOREM 6. If a ring A 1s quasi-Frobeniusean, then 


a) Ur) = = r(U(r)) = 


for every left ideal | and right ideal r. Conversely, if «) holds for every nilpotent 
simple left ideal { and nilpotent simple right ideal r as well as for = r = N 
(radical of A) and { = r = 0, then A is quasi-Frobeniusean. 

THrorEM 7. If A is Frobeniusean, then besides a) we have 


8’) dt) =d(A/r()), = d-(t) = di(A/U(r)) 


for every left ideal { and right ideal rv." Conversely, if «) is valid for every nilpotent 
simple left ideal {, nilpotent simple right ideal tr, = r = N and{ = r = N while 
8’) ts the case for every nilpotent simple left ideal { and nilpotent simple right 
ideal r, then A is Frobeniusean.” 

Proor. The second parts of the theorems can be proved in the same way 
as in I (with a very slight and trivial modification).” 

To prove the first parts, assume that A is quasi-Frobeniusean. Since l(N) 
is a two-sided ideal we have l(N) = >>, Ed(N) = Dox: @,(N). From the 
definition of a quasi-Frobeniusean ring it follows that for each x, 7 the right 
ideal ¢,,1(N) = e,:A fA UN) is simple and is isomorphic to @,)A. Hence 
ex,iU(N) = e,l(N) Ex) , whereas e,,1(N)E, = Oif X # w(x). Thus E,l(N)E, = 
E,U(N) or = 0 according as \ = x(x) or not. Since this is the case for every x, 
we have U(N)Ex« = >>, E,U(N)En®) = Ed(N)Ex = EJd(N). This shows 
that E,J(N) is two-sided. Moreover, it is a simple two-sided ideal. To see 
this, let d be any non-zero element in E,l(N). d = S49 e..d and at least 
one of ed is not zero. Suppose e,,,d ~ 0. Then e,,dA = e,,,l(N) since 
€x,p\(N) is a simple right ideal, and therefore AdA = Ae,,l(N) 2 EN). 
Hence the two-sided ideal E,J(N) is simple. In particular, it is completely 
reducible as a left ideal too, that is, E.l(N) € r(N). Because this is true for 
every x, it follows that (VN) € r(N). But the inclusion of the other direction 
can be seen in the same way, and we have I(N) = r(N). We denote this two- 
sided ideal by M. 

Let { be any non-zero left ideal of A and let {’ be a maximal left subideal of 1. 
Suppose [/l/ ~ Aé,. If ce A then the left ideal fe is isomorphic to 1/1 N U(c), 
as can be seen from the mapping b — be (bef). If in particular c e r(l’) then 
(Nic) DIN Ur(’)) DV whence tc ~ Aé, or = 0. This shows that Ir((’) & M. 
Let now b, be an element in e,{ which is not contained in [’; the existence of such 





*1 In particular, we have d;(A) = d,(A). 

2 It is justified to speak of d; and d, here, since the validity of a) for nilpotent simple 
ideals as well as for N and 0 implies that A is quasi-Frobeniusean and in particular that A 
satisfies, because of the existence of a unit element, the maximum condition too. 

28 One has to modify slightly and in an obvious manner the very last part of i) there, 
while the final step viii) becomes a mere triviality this time. 
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a b; follows from I/’ ~ Aé. Then Ab; Ul’ = [. From the above considera- 
tion we have br(l’) € e,fr(l’) € eM whence bir(l’) = eM or = 0. Thus 
br(l’) ~ @«@A or = 0. On the other hand bir(l’) = r(’)/r(’) A r(by) and 
here r(’) NM r(bis) = r(’) N r(Abi) = r(’ U Ady) = r(Q. Hence finally 
r(')/r() x» &m@A or = 0. 

Consider a composition series of left ideals =OChChC---Cl=A 
of A. The above observation shows that each right module r({)/r(la1) is 
either simple or 0. Since r(0) = A and r(A) = 0, it follows that the length 
of a composition series of right ideals of A is less than or equal to s, the length 
of a composition series of left ideals. But the inequality of the other direction 
can be seen in the same manner. Hence the lengths of composition series of 
right and left ideals are equal to each other, and moreover the right moduli 
r(l:)/r(ligs) (€ = 0,1, ---,s8 — 1) are all simple. Further, all the left moduli 
U(r(te41))/U(r()) must be also simple, and necessarily /(r({,)) = [for all ¢. Since 
there exists always at least on composition series through any given left ideal, 
we have l(r(f)) = I for any left ideal [. Similarly r(/(r)) = + for every right 
ideal r. That is, a) is always valid in A. This proves the first part of 
Theorem 6. 

Our proof shows further that if [// ~ Aé then r(l’)/r(l) ~ & A. And 


this fact, together with the relation d,(Aé,) = d,(&A ) = f(x), shows the validity 
of 8’) in case of a (not only quasi-Frobeniusean but) Frobeniusean ring A. 


5. Corollaries 


THEOREM 8. Let A be a quasi-Frobeniusean ring. The composition length of 
a principal left ideal Ac is equal to that of the principal right ideal cA. Ife = 
C1 + c2 and Ac ts the direct sum Ac = Ac, + Ace, then cA is the direct sumcA = 
aA + cA. Further, if A is not only quasi-Frobeniusean but Frobeniusean, then 
d(Ac) = d,(cA). 

Proor. Cf. I, Th. 4. 

THEeorEM 9. Let A be quasi-Frobeniusean, and let w(x) have the same signifi- 
cance as in the definition. If \' is a maximal left subideal of a left ideal { in A 
and if the left module {/(' is isomorphic to Aé, , then the right module r((’)/r(Q) is 
isomorphic to €,)A. The quasi-field of automorphisms of 1/\' is inversely iso- 
morphic to that of r(’)/r(). Furthermore, r(N”) = I(N”) for any v = 1, 2,--+ ; 
where N is, as before, the radical of A. 

Proor. The first assertion was shown in the proof of Th. 6. The second 
one is, by virtue of the first one, equivalent to the relation €A& ~ Er) Aer) ; 
and this in turn can be seen in the same way as in I, Th. 3, vii). The final 
statement follows from the special case r(N) = I(N), which was already estab- 
lished in the proof of Th. 6; See I, Th. 6. 

THEoREM 10. If in a ring A the relation a) holds for! = t = N (radical), 
1 = r = 0, and both a) and 8’) hold for all nilpotent simple two-sided ideals 

= r = 3, then A is Frobeniusean. 

Proor. Arguments of i), ii), iii) and iv) of I, Th. 7 can be transferred term 
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by term. Now, E.M(= E.ME,w) = oi) eiM, and here the f(x) right 
ideals e,,;M are mutually isomorphic and are direct sums of simple right ideals 
isomorphic to @ A. Hence d,(E,.M) = f(k)d-(€)A) = f(x)f(r(x)) and the 
equality sign holds if and only if eM are simple. But d,(E,.M) = 
d,(A/UE.M)) = d(A/N U A(E — E,)A)) = f(x)’ by our assumption.“ Thus 
f(x) 2 f((x)). Since this is the case for every x, necessarily f(x) = f(x(x)) 
and ¢,:M are simple. Similarly we find that Me,,; are simple left ideals. It 
follows then easily that A is Frobeniusean. 

THEOREM 11. Suppose that a) holds for every nilpotent simple two-sided ideal 
{= r = 3 as well as fort =r=N,t=r=0. Let moreover 


re d,(3) = d,(A/r(3)),  da(a) = di(A/UG)) 


for every nilpotent simple two-sided ideal 3. Then the ring A is Frobeniusean. 
Furthermore, the same is true when we replace the above y) by 


7’) di(3) = di(A/r(3)), — d-(3) = d-(A/1(3)) 
or by 
y”’) dr(3) = di(A/r(3)), — da(3) = d,(A/1(3)). 
Proor. Arguments in i), ii), --- , iv) of I, Th. 7 remain again valid. And, 


again d,(E,M) = f(k)d-(@r)A) = f(x)f(a(x)); the equality sign is true if and only 
if e,,,.M are simple. Moreover, d,(A/r(E,.M)) = d,(A/N U A(E — E,))A) = 
S(m(x))°, d-(A/UE.M)) = d,(A/N U A(E — E,)A) = f(x)’ andd,(A/r(E.M)) = 
f((x))’. Our assertions follow from these relations in quite a similar manner 
as above. 


6. Vector moduli and a theorem of M. Hall 


Let g be a natural number and consider a module % consisting of all g-dimen- 
sional vectors r = (a1, 22, --- ,2,) with components xz, from A. For a subset 
© of A we denote by R(S)[L(S)] the set of vectors » = (y:, ye, --- » Yy) Such 
that (the scalar product) (zx, y) = pa Lpyrl(y, Xr) = >, ¥ptp] = 0 for every 
re. If we take our theorem 6 into account, the first part of the following 
theorem is equivalent to the principal theorem (Th. 5. 2) in Hall (7): 

THEOREM 12. Let A be quasi-Frobeniusean. Then L(R(Q) = &%, R(L(M)) = 
R for every A-left-submodule % and A-right-submodule R in A. If moreover A 
is Frobeniusean then d() = d,(M/R(®)) and d(R) = di(A/L(R)). 

We want to show that we can derive this theorem also from our main theo- 
rems 6, 7, obtaining thus a second proof, though rather long, of Hall’s theorem. 
For that purpose, consider the g-rowed matrix ring B = A, = ) 2. ae EpgA 
over the quasi-Frobeniusean ring A; where €,, is a system of matrix units com- 
mutative with every element of A. 

Lemma 2. B is quasi-Frobeniusean. 





Cf. I, Th. 7, ii). 
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Proor. Evidently éppex,: (p = 1,2,---,g3;« = 1,2,---,k3; += 1,2,---, 
f(x)) form a system of mutually orthogonal idempotent elements whose sum 
is the unit element of B. Q = > €p,V is the radical of B and r(> EnV) = 
U( >> €ngV ) = }e,.M; where N is, as before, the radical of A and M is the two- 
sided ideal r(N) = l(N)in A. WeputP = )oepM. FromeM ~ er@A/eruN 
(with respect to A), it follows easily that €ppeP ~ €pperx)B/epperaoQ & 
€11€r(x)B/ener(e)Q with respect to B. Similarly Pepper, ~ Bene/Qene.. But 
this shows that B is quasi-Frobeniusean. 


Now, consider the A-left-module % = «xB = >>,e,A and the A-right- 
module % = Bey = >o,€n4. The mapping 6: r = (xp) > >> ep%p is an 
operator isomorphism of % and %, with respect to the left operator ring A. 
Similarly, 62: )» = (Yp) — > epyp is an operator isomorphism of % onto 
% with respect to the right operator ring A. Moreover, 6;(r)@e(y) = 
Dept pYp€n = (£,0)en. In particular 6;(r)62(y) = 0 if and only if (zr, ») = 0. 
Hence 6.(R(S))(S | A) is the set of right annihilators of 6:(S) in %,, in the 
sense of multiplication in B. That is, 


(6) 6(R(S)) = Ae N r(61(S)). 
Similarly 
(7) A(L(S)) = % A 1(42(S)). 


On the other hand, B({M %:) = { for every left ideal | in B, while B&, N % = 
%, for every (A-left-) submodule &% of Y. (Because {[ M %, = ex! whence 


Bil nN 1) = Bel = Be, Bi = Bi = I, and BS N , = €1B% aad 61 Bens scary 
Ae: = AG = &.) Similarly (rN Y%)B = rand R.BN A. = Re for every 
right ideal r in B and (A-right-) submodule 9: in %2. We have now 


A(L(R(2))) = % A 1(6(R(Q))) by (7) 
1, M U(Me N r(01(2))) by (6) 
WN U(X A r(Be(®))B) = %W N U(r(Ba(®)) 


%, A Bag) (since B is quasi-Frobeniusean) 
= 6:(&), 


whence L(R(%)) = ¥&. Similarly R(L(M)) = MR, and this proves the first part 
of the theorem. To prove the second part, one has only to notice that if A is 
Frobeniusean then B is so too, as can readily be seen, and that d,(B%,) with 
respect to B is equal to d)(%1)g with respect to A. 

Coroutuary.” Let G be a finite group and let @(A) be the group ring of & over A. 
If A is quasi-Frobeniusean then G(A) is so too. 





*5 This corollary, as well as the above lemma 2, is closely related to the next section. 
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Proor. Let G@ = {G,, Ge,---,G,}. Let & be a left ideal in @(A). We 
want to show that an element Y = > ¥pGp' in @(A) is a right annihilator of 
Y if and only if >i XpY¥p = O for all X = >» 2pG, in %; if we succeed in showing 
this and the corresponding fact for a right ideal, then our corollary is an im- 
mediate consequence of Th. 6 and 12. Now, since >> LpYy is the coefficient of 
the group unit element in the product XY, the “only if” part is trivial. To 
prove the “if” part, suppose that Y satisfies the above condition. Let X be 
an arbitrary element in 2. Then G;'X ¢& for all p = 1, 2,---,g. But the 
coefficient of G, in XY is equal to the coefficient of the unit element in G,'XY. 
It follows then that all the coefficients in XY vanish, that is, YY = 0. Hence 
gY = 0. 

Supplement for the case of an algebra. If A is a Frobeniusean algebra over a 
field F, then we can evidently replace d, and d, in the above theorem 12 by the 
dimension with respect to F. Furthermore, in this case we have the following 
generalization of Th. 2: 

THEOREM 13. Let A be a Frobeniusean algebra. Let &%, % be A-left-submoduli 
of the vector module A such that & > & , and denote the representations of A defined 
by the left module &/% and the right module R(%)/R(Q) by a—> M(a) and a — N(a). 
Then the first representation a — M (a) is equivalent to the representation a* — N(a), 
where a — a* is the automorphism of A given in Theorem 1. 

Proor. On retaining the notations of the section 1, we consider the linear 
function d((t, »)) = A(Doxpyp) of the scalar product (r, »). And, for a subset 
S in UA we denote by p(S)[q(S)] the set of y such that A((z, »)) = O [A((y, r)) = 
0] for all r¢S. It follows in quite a similar manner as before” that q(2) = 
R(2). And, the representation of A defined by the left module 2/% is equivalent 
to the one defined by the right module p(%)/p(%) (Cf. I, 8). Furthermore, 
yg: £ = (t») > r* = (x3) = (¢(zp)) is a 1-1 mapping of on itself (See Th. 1), 
and it is characterized also by the relation A((r*, y)) = A((v, x)) (for all y). 
Evidently q(S)* = p(S), whence R(L)* = p(2). Thus the representation of A 
defined by &/% is equivalent to the one defined by R(%)*/R(L)*, and this is our 
assertion. 


7. Product rings 


Suppose that our operator domain © of A contains a field” F and that every 
operator in F commutes with every operator in 2. Consider on the other hand 
a (finite linear) algebra B over F. By replacing in the usual manner the co- 
efficient domain F by A we obtain a ring C = B X A, which has @ as a domain 
of operators.” (Namely, if b,, bo, ---, bn (n = (B:F)) is a basis of B, then 





* Cf. I, 3. 

27 We naturally suppose that the addition and the multiplication in F coincide with 
those as operators and that the unit element in F is the identity operator. 

28 The conditions about the operator domain at the beginning of the section 4 are satis- 
fied with regard to 2 and C. Moreover, C fulfills the minimum condition for right and 
left ideals. 
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C = A + hA +.--- + b,A. To introduce the law of multiplication in C, 
we first define the product ba of b = >> tib:e B, ae A to be >; bi(éa) and 
then define (>> b,a:)(>> b,a;) to be >-;,;(bib;)(aia;). Further, a D) bia; = 
>> bi(aa;) for a € 2.) 

TuHreoreM 14. The ring C = B X A is quasi-Frobeniusean |Frobeniusean] 
if and only af both A and B are so. 

Proor. If {, and are left ideals in B and A respectively, the module product” 
il is evidently a left ideal in C, and 


(8) r(t) = Br(t) Ur()A 


as can readily be seen.” (Fora proof, take a basis b; , bz, --- , bn of B such that 
r(l) is spanned by 6,41, bsx2, --- , On (Ss = n — (r(h):F)). Suppose that an 
element c = >> b,a; in C is a right annihilator of the left ideal 1,{. Namely, if 
bel, ae then ba(>, bia;) = >> (bb,)(aa;) = 0. Let bbs = >>; Bi;(b)b;. 
Then > :,:0;(8:;(b)aa;) = 0, or 


(9) >: 6:(b)aa; = 0 forallj = 1,2,---,n. 


Here we notice that 6;;(b) = 0Oif7 = s+1,s+2,---,n. Now, let b run over 
a system of generators b", b®,.-., b” of ,. Then the s matrices B; = 
(8:;(b”)) jp (¢ = 1, 2,---, 8) are linearly independent, because otherwise a 
non-trivial linear combination of 6; , be , --- , bs would be a right annihilator of 
{,. In other words, the system of linear equations >-{-:2:8;;(b”) = 0 (j = 
1,2,---,n;p = 1, 2, --- , t) has no non-trivial solution, that is, the rank of 
the matrix (8;;(b” ));,;p (¢ = 1,2, ---, s)iss. But then ) 3-1 8;;(b™)aa; = 0 
(cf. (9))” implies aa; = 0 (i = 1, 2,---, 8). Since a is any element in [, this 
shows that a,, a2, ---,a,¢e7() andc = >, bia; = 2s ates + ys bia; € 
Br(t) U r()A. Thus r(&1) © Br(t) U r(l)A. The inclusion of the other 
direction is trivial.) 

a) Now let A and B be quasi-Frobeniusean. Let N, Q be the radical of A, 
B respectively, and put M = r(N) = UN), P = r(Q) = l(Q). The two-sided 
ideal BN U QA in C = B X A is evidently contained in the radical S of C and 
we have, according to (8), r(BN U QA) = r(BN) Nr(QA) = BMN PA = PM. 
Hence r(S) ¢ PM. 

Let g and e be any primitive idempotent elements in B and A respectively. 
There exist, since B and A are quasi-Frobeniusean, primitive idempotent 
elements g’, e’ in B, A such that Pg ~ Bg’/Qg’ (=~ Bg’ U Q/Q) with respect to 





29 The structure of C thus obtained is of course independent of the special choice of 
the basis b1 , be, «++ , bn. 

30 The product of an element in B with an element in A is defined as in the above eluci- 
dationof C=BX A. 

31 Here, as well as in the whole following proof, r(*) denotes the set of right annihilators 
of the argument in A, B or C according as the argument lies in A, B or C. 

32 Cf. also a remark following (9). 
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(the left operator ring) B and Me = Ae’/Ne’ (= Ae’ U N/N) with respect to 
A. Then, as can readily be seen, 


(10) PMge x Cg’e’'/(BN U QA)g'e'(= Cg’e’ U BN U QA/BN U QA) 


with respect to C. Now the following fact is more or less well known and is 
easy to see: 

Lemma 3. If in particular B and A are semisimple, then B X A is Frobeniu- 
sean; (it is in fact uni-serial (= einreihig) in the sense of Kéthe (9)*). 

On applying the lemma to the simple algebra Bg’B U Q/Q and the simple 
ring Ae’‘A UN/N (instead of B, A), we find that Cg’e’/Sg’e’ is isomorphic, 
with respect to the left operator ring C, to the largest completely reducible 
submodule of Cg’e’/(BN UQA)g’e’. But this latter is, because of (10) and 
r(S) & PM, isomorphic to r(S)ge, and therefore 


(11) Cg'e’/Sg’e’ = r(S)ge. 


Let ge =jit+jet+--- +jpandg’e’ = ji +j2 + --- + J, are decompositions of 
ge and g’e’ into mutually orthogonal idempotent elements in C. Then the left 
side of (11) is a direct sum of exactly q simple submoduli, while the right side is a 
direct sum of at least p simple submoduli.“ Hence p < g. On the other hand, 
the right moduli g’P, e’M of B, A are isomorphic to gB/gQ, eA/eN respectively 
and it follows in the same way as above that g < p. Thus q = p, and all the 
left ideals r(S)j: , r(S)je, --- , r(S)jp are necessarily simple. Moreover, they 
are isomorphic, up to their arrangement, with Cj:/Sji , Cj2/Sj2, --- , Cj,/Sj,- 
Similarly the right ideals j;1(S), j21(S), --- are simple and isomorphic to j;C/j,S, 
joC/j28, --- except their arrangement. 

If we apply this consideration to all pairs of primitive idempotent elements g 
and e appearing in a decomposition of the unit elements in B and A, then we find 
readily that C = B X A is quasi-Frobeniusean, and that it is even Frobeniusean 
if both B and A are so. 

(A second proof of the quasi-Frobeniusean part can be obtained in the follow- 
ing way: First let B be not only quasi-Frobeniusean but Frobeniusean. Then 
the parastrophic determinant of B is not identically zero, and an easy modifica- 
tion of the proof of the corollary in the preceeding section 6 shows that the 
relation a) holds in C = B X A, whence C is quasi-Frobeniusean. The case 
of a quasi-Frobeniusean B can be reduced to this Frobeniusean case by a device 
similar to that of Part I, 3.) 

b) Assume conversely that C = B X A is quasi-Frobeniusean. Then r(C) = 
0, whence r(B) = 0 and r(A) = 0 because of (8). Hence the same relation (8) 
shows that r(Bl) = Br({) and r(4,A) = r(li)A. If we combine these relations 
and their (left, right) duals with the relation a) in C, which is valid by our 


” 





38 Cf. also the section 9 below. 
34 Observe that r(S)j = r(S) N Cj is thelargest completely reducible left subideal of Cj, 
which is not 0 evidently. 
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assumption, then we find that a) is valid also in both A and B. Hence A and B 
are quasi-Frobeniusean. 

That if C is Frobeniusean then A and B are so can be seen from the analysis in 
the above a) which is, since A and B are quasi-Frobeniusean at least, now 
applicable. 

(It is also possible to prove the first part of this b) by a structural analysis 
similar to a). On the other hand, in case (not only B but) A is an algebra over 
F,, the second part of b) may be seen also by using (8) and the relation 8) (1, 
Th. 1) or 6’) in C.) 

We now prove the following supplement of Th. 12, Corollary: 

Coroutuary. Let A be Frobeniusean. Then the group ring @(A) is Frobeniu- 
sean too. 

Proor. The A-left-moduli A/N and M = r(N) = I(N) are isomorphic to 
each other. It follows easily. that the G(A)-left-moduli G@(A/N) (~G(A)/ 
(N)) and @(M) are isomorphic. In particular, the largest completely re- 
ducible @(A)-left-submodule of @(M) is isomorphic to that of G(A/N). But 
the latter is in turn isomorphic to the residue class module of G(A/N) with 
respect to its radical, because the group ring @(A/N) over the semisimple ring, 
A/N is Frobeniusean according to the above theorem 14.” Since G(N) is 
evidently contained in the radical S of G(A) and since @(M) contains the right 
annihilator r(S) of S in @(A), we find that the residue class module @(A)/S 
is isomorphic to the largest completely reducible left ideal of G(A), with respect 
to the left operator domain @(A). But we know already that @(A) is quasi- 
Frobeniusean at least. And, these two facts assure that @(A) is Frobeniusean. 


8. Residue class rings 


THEOREM 15. Let A be a Frobeniusean ring and let 3 be a two sided idealin A. 
The residue class ring A = A/3 is Frobeniusean if and only if the two-sided ideals 
r(3) and 1(3) are respectively a principal right and a principal left ideal: r(3) = bA, 
l(3) = Ae. 

Proor. Let {p} be the set of such indices that E,¢3. Since #, are idem- 
potent, E,¢3 UN either. (For, (2g + n)’ (23, ne N) is a sum of n’ with an 
element in 3, and n’ vanishes for a svfficiently high vr.) Denote the residue 
classes of E, , e,,; (mod 3) by E,, é,,:. MN = N U3/; is the radical of A, and 
A/N = > ,(4/N)E, is the (unique) decomposition of 4/N into a direct sum of 
simple two-sided ideals. é,,; are primitive idempotent elements in A, since 
Aé,,;/Né,,; are simple. Moreover, E,(r(3) MM) # 0 if and only if «  {p}, 
where M = r(N) = I(N) as before, because 1(r(3) NM) = U(r(; UN)) = 3 UN. 





35 We decompose first A/N into a direct sum of simple rings and then consider the group 
rings over those simple components. ‘Phey are Frobeniusean, because they can be con- 
sidered as products of those simple rings with the group algebras constructed over their 
centers, for instance. 
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On the other hand, M is a direct sum of simple two-sided ideals E.M = ME, 
and every two-sided subideal of M is, as can readily be seen, a direct sum of 
some E,M. Hence 


(12) r3) NM = 20,E,M(= DOME) = Dop.sMez.:)- 


a) Assume now r(3) = bA. We observe first that e,,.b ~ 0 if and only if 
xke{p}. The left ideal Ae,,,b is isomorphic to A/I(e,,:b) and here l(e,,:b) = 
U(e,,bA) = Ue, iA NbA)” = Ue,,A) UU(bA) = A(E — e,,) U3. That is, 
Ae,,ib ~ A/A(E — e,,:) U3 = Aé,,;. Moreover Ab = A/I(b) = A/; = A. 
Since Ab is the sum of Ae,,:b while A is the direct sum of Aé,,; , we find that Ab 
is indeed the direct sum of Ae,,:b.” Each Ae,,,b contains at least one simple 
left subideal, and therefore the largest completely reducible leit subideal Ab NM M 
of Ab is a direct sum of at least >>,f(p) simple left ideals. But AbNM © 
r(3) MM, since Ab © r(3), and here (3) MN M is the direct sum of >_,f(#(p)) = 
> f(p) simple left ideals Me,.,),; (See (12)). Hence necessarily AbN M = 
r(3) 1. M and each Ae,,;b has only one simple left subideal. Furthermore, since 
Ae, ab > --- > Ae, spb (for Aé,im-:-- =~ Aeé,,s(»)), it follows that there exists a 
permutation {v(p)} of {p} such that the unique simple left subideal of Ae,,,),;b is, 
for each p, i, isomorphic to Me, ,),; ~ Aé, ~ Aé,/Né,. Thus Aé,,,),; has also a 
unique simple left ideal, which is isomorphic to Aé,/Né,. We observe also 
that f(v(p)) = f(x(p)) = f() according to our construction. 

Assume further 1(3) = Ac. Then we find in the same way as above that 
every @,A has only one simple right subideal, and this simple right ideal is, by a 
remark in 4, necessarily isomorphic to @,,,)4/é ,N. A is therefore Fro- 
beniusean. 

b) To prove the converse, suppose that A = A/; is Frobeniusean. Let P 
be the annihilator ideal of the radical N in A, and denote by P the two-sided 
ideal in A consisting of those elements whose residue classes (mod 3) lie in P. 
We consider further r(3) and r(P) in A. The latter is the intersection of all 
maximal right subideals of the former, because P is the sum of all those left 
ideals in which 3 is a maximal left subideal. 

There exists, by definition, a permutation {v(p)} of {p} such that é,P = 
@npA (x ExpA/expN), Pep) ~ Ae, and f(r(o)) = f(p). We have E,P = 
PE,.,,, or, E,P U3 = PE,,) U3. Now, 3 coincides with the intersection 
Ne doe EP U3), p running over {p}. Hence 


(13) r(3) = Loot DoepepleP U 3); 


the summands are two-sided ideals since >..«,»H.P U3 are such. Furthermore, 
P/>o.4,BP U3; = E,P is, for each p, a direct sum of f(»(e)) = f(e) simple 
A-left-submoduli isomorphic to Aé, , and therefore r( > .«,..P U 3)/r(P) is a 





36 Observe that bA is two-sided. 
37 This follows also from Theorem 8. 
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direct sum of f(p) simple A-right-submoduli isomorphic to @,:,)A (Cf. Theorem 
9). But 


r(DouepLeP U3) = r(DoeenPEc Us) = r((PN Adi cere) Ua) 


14 
ey (r(P) U By)A) Nr) = r(P) U (ExpA Arg)” = Enor(s) Ur(P), 


whence 


r( Doxgr(p) EP 13) /r(P) = Exp)r(3) Ur(P)/r(P) -_ > (enp.0°() U r(P)/r(P)). 


Hence we find that the right moduli e,;,),77(3) Ur(P)/r(P) are simple and 
= Er (pA. 

- According to this fact, we now take for each p, 7(i = 1, 2, --- , f(p)) an element 

bp,i IN €y¢p),:7°(3)€x(p),i Which is not contained in r(P); observe that f(a(e)) = f(e) 

= f(v(p)). Then b,;A Ur(P) = ey,),e7(3) Ur(P). Moreover, if we put 

b= > , then be,x:,),; = b,,; , and therefore 


bA Ur(P) = Diy iler.ir(3)) Ur(P) = L(Exr(3)) Ur(P) = 7) 


(ef. (13), (14)). However, r(P) is the intersection of all maximal right subideals 
of r(3), as we observed before. Hence necessarily bA = r(3). 

Similarly 1(3) is a principal left ideal, 1(3) = Ac, and this completes the proof. 

THEOREM 16. Leta ring A satisfy (not only the minimum condition but) also 
the maximum condition for left and right ideals. In order that every residue class 
ring of A be Frobeniusean, it is necessary and sufficient that every two-sided ideal 3 in 
A can be expressed as 3 = Ac = cA (ce A). 

Proor. a) Assume that every residue class ring of A is Frobeniusean. Then 
in particular A is Frobeniusean, and the above theorem 15 tells that every two- 
sided ideal 3(= r(l(3)) is a principal right ideal 3 = cA. Moreover d,(3) = 
d,(A) — d,(A/3) = di(A) — d,(A/3) = d,(3), for both A and A/; are Frobeniu- 
sean.” But d,(;) = d,(cA) = d,(Ac) by Theorem 8. Hence d,(3) = d;(Ac). 
Since Ac € 3 we have Ac = 3. 

b) Assume next that the condition of the theorem is satisfied. The existence 
of a unit element in A can be seen in the same way as in I, Th. 10. Let 3 = 
Ac = cA. The left ideal 3 = Ac is isomorphic to A/l(c) = A/l(cA) = A/l(3), 
whence d,(3) = d,(A/l(3)). Since this is the case for every two-sided ideal, we 
have r((3)) = 3. For, U(r(l(3)) = 1) whence di(r(l(3)) = di(A/U(r(U(3))) = 
d(A/U(3)) = di(3), which, together with r(1(3)) > 3, implies r(1(3)) = 3. Simi- 
larly d,(3) = d,(A/r(g)) and l(r(3)) = 3. We find therefore, according to Theo- 
rem 11, that A is Frobeniusean. Our assertion that every residue class ring of A 
is Frobeniusean is now an immediate consequence of Theorem 15. 





38 Modular law. 

39 See the footnote 21, or Th. 8. 

4° Here we use the composition lengths, and that is the reason that we assumed the 
maximum condition too. 
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9. Appendix: Generalized uni-serial rings 

As a generalization of the notion of Kéthe’s uni-serial ring,” we introduce the 
following 

DEFINITION. We call A a generalized uni-serial ring, if A has a unit element 
and if every left ideal Ae as well as every right ideal eA generated by a primitive 
idempotent element e possesses only one composition series.” 

The connection of this notion to our study is that a ring in which every residue 
class ring is Frobeniusean, as in the case of Theorem 16, is certainly a generalized 
uni-serial ring, although the converse is not true. 

Now, consider a generalized uni-serial ring A. Let N’"' ¥ 0, N’ = 0, where 
N is, as before, the radical of A. Let (x) be, for each x, an integer such that 
e.N°” # 0, e,.N* = 0; hence p = Max,(o(x)). Since eA has only one com- 
position series, every e,N*/e.N** (i = 0,1, --- , o(x) — 1) is simple. We put 


e,N*/e,.N*** =~ bete,iA- 

Thus ¢,Neg¢,i) GN and if d is an element in e,N "Coie,i) NOt contained in N oe 
thendA =eN'. Letnowi+ j S o(x) — 1. Then e,N'*? = CN eg, AN’? = 
CcN'egix,N’. Hence 7 S a(y(x, t)) — 1. Moreover, since @g¢,9N’ = 
Ce, Coigix.) nA; we have we .. e.N**? a CN Co(x,1) NVC gie(«.i).5)A .. 
eN*egietx,i),9A-. Therefore 
(15) e(k, t + J) = o(y(k, 2), J). 
We put similarly N"e, & 0, Ne, = 0 and 

N‘e,/N**e, = A Cy («.i) 
for? S r(x) — 1. Then 
(16) ¥(x,t + 7) = Wk, 2), J) 


provided 7 + j S r(x) — 1. Further, if 7 S r(x) — 1 then? S a(x, z)) — 1 
and (W(x, 7), 2) = x, because eye,» N'e, GN’. Hence, if j < r(x) — l,i s 
a(W(k, j)) ar 1, then 


elolW(x, J), 9), 4 — J) = elk, t — j) or 
e(W(Y(x, 7 mo 1), 1), z) = V(k, J ial 1) 


according as i = j ori Sj.” 

After this preparation we now proceed in a similar manner as in Kéthe (9). 
Suppose that a left module I of our A is a (restricted) direct sum > M, of 
(an arbitrary number of) submoduli I, and that each M, is homomorphic to 


(17) e((«, J), 2) = 





‘1 Kéthe (9). 

#2 See the footnotes 40 and 41 of Part I. 

‘We obtain a similar formula by interchanging o, ¢ with r, y. In particular, 
o(v(x, J), 7) = Welk, 7), j) provided j S r(x) — 1,7 S o(¥(x, j)) —1,i So(x) -—1,9 8 
r(y(x, 7)) — 1. 
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one of Ae,. Let I, be homomorphic to Aes) and let es) correspond to u, in 
M, ; we have M, = Au, = Aexs)Us. Denote further the composition length of 
M,. by i(s). Then M, ~ Aees)/N eco), and N“?u, ¥ O while N“u, = 0. 
Let m be the smallest of all z(s). 

Lemma 4. If u (40) ts an element in eM such that the composition length | of 
Au is not greater than m, then there exists an element v in egixm—yDt such that 
uee,N” ‘v and the composition length of Av is exactly m. 

Proor. Suppose for instance u = ayu; + Gee + --- + ae , where a, € Aegis) 
anda, ¥ 0(s = 1,2,--- ,t). Since wu = eu = eau + --- + eeu: , We Can, 
and shall, assume a, € ¢Aéxs). Let dy € Nexis) but ¢ Nes). Then 


(18) k = H(«(s), U(s)), —_«(s) = ox, US), 


and Aa, = N' es). On the other hand N’u = 0 and N’“u %& 0. From 
N’aus = N’Aaus = N* eu, = N7*"u, (s = 1, 2, --- , 8), it follows that 
1+ U(s) = z(s) for all s = 1, 2, --- ,¢ while 2 + U(s) S (whence =) 7(s) for at 
least one s. 

Take an element d such that deeN” “yiem—y but ¢ N”"“"’. Then“ a, « 
ON ees) = ON NO = dN'®-™*"e., whence a, = db, with 
bs € em” "tine . Here e(e(k, aglines l), l(s) a i l) o(k, m—l + 
l(s) — m + 1) = o(k, l(s)) = «(s) according to (15), (18), and Ab, = N'©~"*e,¢4) . 

Put now v = byw, + deus + --- + bu. Then u = dv and eyemyv = v. 
N”v = 0, because I + I(s) = i(s) whence Nb, GN"N'®~"*"e,..) = N' Me) = 
0(s = 1,2,---,#. But N”’v ¥ 0, since N” db, = N’*e,.) and U(s) + 
l S 2i(s) for at least one s. Hence the composition length of Av is exactly m. 

Lemma 5. Let Dt be the same as above. Suppose moreover that IN is contained 
in an over (A-left-) module m and that m is generated by IN and a cyclic module M’ 
which is homomorphic to one of Ae, and whose composition length does not exceed m. 
Then m is a direct sum M + M’”’ of M and a second module M’’ homorphic to M’. 

Proor. Let Dt’ = Au ~ Ae,/N*e,, u = eu, and put U = MNM’. If 
ll = 0, then we are already through. Suppose Ul ¥ 0 and let U = N’u = 
N'e,u ; the composition length of Wis then hk — 1. Take an element c in eyy,yN’e, 
not contained in N’*’. Then cu el and Acu = N'u = U. But Acu is homo- 
morphic to Aéy«,. . Hence there exists, by the above lemma 4, an element v 
in M such that v = ev, Av ~ Ae/N™e, and cu €eyu,yN” “—”v, where 
e((x, 1), m — (h — 1)) = g(x, m — h).° But eye nN ey a eyieyN'N™ “ey 
cN”“*e, = ce,N” *e,. Therefore cu = cc’v with an element c’ ine,N”™ “e . 

We put now wu” = u—c'v. Obviously m = MU Aw’. Moreover, e,w’’ 
eu — ec’'v = u — cv = wv’, whence Au” is homomorphic to Ae,. Since cu’”’ 
cu — cc'v = 0, we have N'u’’ = Acu” = 0. Furthermore MN Au” = 
that is, m = I+ Aw”. For, if aw” = we Mae Ae,), then au — ac’v = 
au=wtacveM. Henceaue MN Au = U, and therefore a e N’e, , au’’ = 0. 
This completes our proof. 





44 Observe that 1 + l(s) = i(s) 2 m. 
45 Cf. (17). 
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Now, the following theorem, which is an extension of the main theorem in 
Kéthe (9), follows readily from the above lemma: 

TurorEM 17. Let A be a generalized uni-serial ring. Then every left module 
of A is a (restricted) direct sum of cyclic submoduli, cach of which is homomorphic 
to one of Ae,. The similar situation prevails for a right module of A. 

Remark. In the special case where A is, as in Part I, an algebra, the converse 
is also true. Namely, an algebra which possesses the property of Theorem 17 is 
necessarily a generalized uni-serial algebra. For, if a left module N' 'e/N’‘e, 
where e is a primitive idempotent element, is not simple, then a right module 
which belongs to a representation of A equivalent to the one defined by the left 
module Ae/N‘e is homomorphic to none of e,A, whereas it is certainly directly 
indecomposable.” 

Added in proof (Oct. 22, 1940): The remark adjoining the definition of Fro- 
beniusean rings (Chapter IT, 4) ought to have been placed after Theorem 6. 
In the proof of the theorem we showed l(N) © r(N) using only the condi- 
tion i), and from this follows the relation e,r(N) 2 e,l(N) = x, used in the 
remark. 

Asano (2), Theorem 2 and our Theorem 16 express essentially one and the 
same fact (Cf. Part I, Theorem 10 too). Namely, a ring whose residue class 
rings are all Frobeniusean is uni-serial. Further, the converse of Theorem 17 
is valid generally. For all this cf. the writer’s Note on uni-serial and generalized 
uni-serial rings, Proc. Imp. Acad. Tokyo Vol. XVI (1940) p. 285. 
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UBER DIE TOPOLOGIE DER GRUPPEN-MANNIGFALTIGKEITEN 
UND IHRE VERALLGEMEINERUNGEN 


By Heinz Hopr 
Received* by Compositio Math., August 23, 1939 
Einleitung.’ 
1. In der geschlossenen und orientierbaren Mannigfaltigkeit M sei eine 
“stetige Multiplikation” erklairt, das heiSt: jedem geordneten Punktepaar (p, q) 


von M ist als “Produkt” ein Punkt pg von M zugeordnet, der stetig von dem 
Paar (p, g) abhingt. Setzen wir 


pq = 1,(9), 


so ist l, bei festem p und variablem q eine Abbildung von ™ in sich; die Abbil- 
dungen |, hingen stetig von dem Parameter p ab, und sie haben daher alle den 
gleichen Abbildungsgrad c;. Analog ist der Grad c, der Abbildungen r, be- 
stimmt, die durch 


pq = r,(P) 
gegeben sind. 


Definiert man etwa die stetige Multiplikation so, daf pq fiir alle (p, q) ein 
fester Punkt von M ist, so ist c, = c, = 0; setzt man pg = p oder setzt man 
pq = q fiir alle (p, q), so ist c, = 0, c, = 1 baw. c, = 1,¢, = 0. Diese trivialen 
stetigen Multiplikationen sind in jeder Mannigfaltigkeit méglich; dagegen kann 
man, wie sich zeigen wird, nur in sehr speziellen Mannigfaltigkeiten stetige 
Multiplikationen so definieren, daf 


Cc, ~ 0 und cy FO 


ist. Eine Mannigfaltigkeit,’ welche eine solche Multiplikation zula£t, soll eine 
l-Mannigfaltigkeit hei®en.”* 

Der Begriff der '-Mannigfaltigkeit ist eine Verallgemeinerung des Begriffes 
der Gruppen-Mannigfaltigkeit; ist nimlich M eine Gruppen-Mannigfaltigkeit, 
d.h. ist in M eine stetige Multiplikation erklirt, welche die Gruppen-Axiome 
erfillt, so ist fir den Punkt e, welcher die Gruppen-Eins darstellt, sowohl die 





* Editors’ Note. This paper was originally submitted to Compositio Mathematica, 
August 23, 1939. It was transferred to the Annals of Mathematics (received November 18, 
1940) after the Compositio Mathematica ceased publication. 

1 Kine kurze Ankiindigung dieser Arbeit ohne Beweise ist in den C. R. 208 (1939), 1266- 
1267, erschienen. 

2 Unter einer ‘“‘Mannigfaltigkeit’’ ist in dieser Arbeit immer eine geschlossene und orien- 
tierbare Mannigfaltigkeit zu verstehen. 

* Die Giltigkeit des assoziativen Gesetzes wird also nicht gefordert. 
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Abbildung I, als auch die Abbildung r, die Identitit von M, und daher ist 
q(j=- Gc = e 

Somit gelten alle Sdize, die fiir T-Mannigfaltigkeiten bewiesen werden, insbe- 
sondere fiir geschlossenen Gruppen-M annigfaltigkeiten.° 


2. Wir werden Homologie-Eigenschaften von Mannigfaltigkeiten unter- 
suchen; dabei soll als Koeffizientenbereich der Kérper der rationalen Zahlen dienen.* 
Wie iiblich fassen wir die Homologieklassen einer Mannigtaltigkeit M 2u dem 
Homologie-Ring %%(M) zusammen: in ihm ist die Addition die der Bettischen 
Gruppen, und die Multiplikation ist durch die Schnitt-Bildung erklirt. Infolge 
der Benutzung rationaler Koeffizienten entgehen uns zwar gewisse Feinheiten 
der Struktur von M, so die etwa vorhandene Torsion; immerhin stimmen zwei 
Mannigfaltigkeiten M,, M2, deren rationalen Homologie-Ringe %(M,) und 
§ (M2) einander dimensionstreu isomorph sind, in den wichtigsten algebraisch- 
topologischen Eigenschaften tiberein, insbesondere in den Werten der Bettischen 
Zahlen. 


Unser Hauptziel ist der Beweis des folgenden Satzes: 
Satz I. Der Homologie-Ring R(T) einer T-Mannigfaltigkeit T ist dimensions- 
treu isomorph dem Homologie-Ring R(II) eines topologischen Produktes 


oe Keak. XR, et 


in welchem Sm die m-dimensionale Sphdre bezeichnet und alle Dimensionszahlen 
m1, M2, +--+ , m, ungerade sind. 


3. Da man die Struktur der Ringe (JJ) vollstandig tibersieht, kann man den 
Inhalt des Satzes I auch durch eine ausfiirhliche Beschreibung der Struktur der 
Ringe R(T) ausdriicken. Hierfiir machen wir noch die folgenden terminolo- 
gischen Bemerkungen: 

Der Ring %(M) einer beliebigen n-dimensionalen Mannigfaltigkeit” M enthilt 
ein Eins-Element: es wird durch den orientierten n-dimensionalen Grundzyklus 
von M dargestellt; wir bezeichnen es durch 1. Die Dimension eines Elementes 
z von #(M) nennen wir d(z); daneben betrachten wir hiiufig die “duale Dimen- 
sion” 6(z) = n — d(z). Unter einer “vollen additiven Basis” von #(M) vor- 
stehen wir die Vereinigung von Homologie-Basen der Dimensionen 9, 1, --- , ”. 

Nun 1a8t sich der Satz 1 folgendermafen aussprechen: 





3 Die Topologie der Gruppen-Mannigfaltigkeiten wird in den folgenden beiden Schriften 
von E. Cartan behandelt: (a) La Théorie des Groupes Finis et Continus et V Analysis Situs 
[Paris 1930, Mémorial Sc. Math. XLII]; (b) La Topologie des Groupes de Lie [Paris 1936, 
Actualités Scient. et Industr. 358; sowie: L’Enseignement math. 36 (1936), 177-200; sowie: 
Selecta, Jubilé Scientifique, Paris 1939, 235-258]. 

4 Tatsaichlich werden wir von dem Koeffizientenbereich nur benutzen, daf er ein Kérper 
der Charakteristik 0 ist. 
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Satz I (2. Fassung). Aus dem Ringe R(T) einer T-Mannigfaltigkeit T lassen 
sich Elemente 2, , 22, --- , 2 so auswdhlen, da’ die 2' Elemente 
1 : Zi : Ziy-2ig (11 < te) ; 
Zi, * ig? Zi (43 < t < 73) : bre : Zi-Ze: +++ By 
eine volle additive Basis bilden; die Multiplikation in R(T) ist durch das distribu- 
tive Gesetz, das assoziative Gesetz und die antikommutative Regel 
2p 25 = 252; 
— in welcher speziell die Regel 
zz; = O 


enthalten ist—vollstindig bestimmt; kurz: R(T) ist der Ring der (inhomogenen) 
Multilinearformen in den antikommutativen Groen 2; , 2, --- , 2: mit rationalen 
Koeffizienten. Alle z; sind homogendimensional,’ und ihre dualen Dimensionen 
5(z;) = m,; sind ungerade; die Dimension von T ist 


n=m +m+--- +m, 
und allgemein ist 
d(Z;, + Ziz: ae 24.) =n — Mi, — Mm, — --- — Mi, . 


Daf hierdurch gerade das Bestehen eines dimensionstreuen Isomorphismus 
zwischen R(T) und M(I1) ausgedriickt wird, erhellt aus der folgenden Tatsache, 
die man leicht bestiatigt: bezeichnet man den orientierten Grundzyklus von 
S» selbst mit S,, und mit p immer einen einfach gezihlten Punkt, so besitzen - 
in R(I1) die Elemente 


Z: = p X Sm, X Sm; X Pas X Se, , 
(*) Ze = Sm, X p X Sms X +++ K Sm, , 


Zi = Sm, X Sn. X Sn; X es xX p 


genau die analogen Eigenschaften, welche wir in (I) soeben den Elementen 
21, 22, +--+, 2 zugeschrieben haben. 


Aus der zweiten Fassung des Satzes I liest man unter anderem die folgende 
bemerkenswerte Eigenschaft der '-Mannigfaltigkeiten ab: 
Satz Ib. Jedes homogen-dimensionale Element z von R(T), fiir welches die 





° Kin Komplex K hei8t homogen-dimensional von der Dimension r, wenn jedes Simplex 
von K auf einem r-dimensionalen Simplex von K liegt. Ein Element des Homologie- 
Ringes (M) hei&t homogen-dimensional, wenn es in M durch einen homogen-dimen- 
sionalen Zyklus reprisentiert wird. Die additive Gruppe von #(M) ist die direkte Summe 
der Gruppen der homogen r-dimensionalen Elemente mit r = 0, 1, --- , n. 
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duale Dimension 6(z) gerade und positiv ist, la&t sich durch Multiplikation und 
Addition aus héherdimensionalen Elementen erzeugen. 


4. Der Satz I enthalt weitgehende Aussagen iiber die Bettischen Zahlen einer 
r-Mannigfaltigkeit. Unter dem “Poincaréschen Polynom” eines Komplexes 
K verstehen wir das Polynom 


Px(t) = po + pit + pol + --- 


in einer Unbestimmten t, wobei der Koeffizient p, die r-te Bettische Zahl von 
K ist. Fir die Sphiare S,, ist 
Ps (t) = 1+ 0"; 


bei der Bildung des topologischen Produktes K, X K: zweier Komplexe K,; und 
K, gilt nach der Formel von Kiinneth’ die Regel 


Prxx,(t) = Pr,(t)-Px,(); 


daher ist in dem Satz I (Nr. 2) der folgende Satz enthalten: 
Satz I’. Das Poincarésche Polynom einer T-Mannigfaltigkeit T hat die Gestalt 


(1) Pr) = (1+ 0) + 0). (+ "4, 


wobet alle Exponenten m; ungerade sind. 

Wir heben einige der zahlreichen Beziehungen zwischen den Bettischen 
Zahlen hervor, die sich aus (1) ablesen lassen: 

(a) Die Eulersche Charakteristik ist 0;' 

denn die Charakteristik eines Komplexes K ist gleich Px(—1). 

(b) Die Summe der Bettischen Zahlen ist eine Potenz von 2;° 

denn diese Summe ist fiir einen Komplex K gleich Px(+1). 

T sei n-dimensional; dann ist p, = 1, p, = 0 fiirr > n, also n der Grad von 
Py(t) und 

m + m+ --- +m = Nn. 


Da der r-te Koeffizient des Polynoms (1) offenbar nicht gréfer ist als der r-te 
Koeffizient des Polynoms 


AQ+)*=(14+0"-(1+)™".---(1+0™, 
so sieht man: 


(c) Es ist p, < (”) fir alle v2 


6 Alexandroff-Hopf, Topologie I (Berlin 1935), 309, Formel (13’). 

7 Falls die Abbildungen 1, und r, topologisch sind—also insbesondere, falls [ eine 
Gruppen-Mannigfaltigkeit ist—, wird fiir p: ¥ p2 durch f(q) = lp;lp2(q) eine Abbildung von 
r auf sich erklart, welche sich stetig in die Identitat deformieren la8t und keinen Fixpunkt 
besitzt; dann folgt der obige Satz aus einem bekannten Fixpunktsatz. 

5 Fiir Gruppen-Mannigfaltigkeiten: Cartan (b), 24. 

* Fir Gruppen-Mannigfaltigkeiten: H. Weyl, The classical groups [Princeton 1939}, 
279, als Korollar eines Satzes von Cartan (cf.!*). 
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Man kann (1) in der Form 
(1’) Ppt) =(1+0)"-14+F)*- 1+ &)%.--, 


schreiben; Ausrechnung ergibt 


Q”) Prt) =1+he+ (Bye + (1 + (i) )i + (ui + ())e donee 


Ks ist also 
(2) yh = l, ° 


Da nun der Koeffizient von ¢’ in dem Produkt (1’) offenbar nicht kleiner ist als 
in dem Faktor 


I+a%=(1+0", 


so gilt: 
(d) Es ist p, = (*) fir alle r.° 
Nach (1’) ist 
l + 3ls + 5b, + --- =n, 
also nach (2): 
~A = n — 3ls — 55 — --- 


daher lat sich (c) fiir r = 1 verschirfen: 
(e) Es ist entweder p, = n oder py = n — 3 oder px S n — 5." 


Ferner liest man aus (1””) und (2) die folgende Verschirfung von (d) fiir r 
ab: 


(f) Es ist py, = (=). 


also speziell: 

(fo) Ist py = 0 oder p, = 1, so ist pp = 0." 
Ebenso sieht man aus (1’’) und (2): 

(g) Ist py = 0, so ist auch py = 0. 


Man kann ohne Miihe noch eine Reihe ahnlicher Relationen feststellen, 2.B. 
die folgenden: 





10 Fir Gruppen-Mannigfaltigkeiten wie’; fir wesentlich allgemeinere Raume mit 
stetiger Multiplikation: W. Hurewicz [Proc. Akad. Amsterdam 39 (1936), 215-224]. 

11 Die Relation p, S n wurde fiir verallgemeinerte Gruppenriume zuerst von P, Smith 
[Annals of Math. (2) 36 (1935), 210-229] und dann von Hurewicz als Korollar aus dem 
unter!® zitierten Satz bewiesen. 

12Wegen der zweiten Bettischen Zahl einer Gruppen-Mannigfaltigkeit vergl. man 


Cartan’, (b), 14 und 23-24. 
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(h) Es set p, = 0; dann ist 
3p3 + 5ps + 7pr 


IA 


n, 


Pir = ‘ ‘ fiir « = 3, 5, 7 und belieliges r. 

5. Fir Liesche Gruppen kann der Satz I, auf Grund von Siatzen, die wir E. 
Cartan und G. de Rham verdanken, aus der Sprache der Homologie-Theorie 
in die Sprache der Theorie ddr invarianten Differentialformen iibersetzt werden.” 
Ich begniige mich mit der Formulierung des Ergebnisses, im Anschlu® an die 
2. Fassung des Satzes I (Nr. 3):"* 

Die Mannigfaltigkeit G reprdsentiere eine geschlossene Liesche Gruppe. Dann 
kann man aus der Gesamthett der Differentialformen, welche in G invariant gegeniiber 
den Operationen der Gruppe sind, Formen 


@1,W2,°** ,@, 


deren Grade 


mm, M2,-++ , Mm 


seien, so auswdhlen, da sie die folgenden Eigenschaften besitzen: 
1) Fiir jedes r bilden diejenigen duberen Produkte 


Wi, *Wig* eee "Wi, ; 
fiir welche 
mi, +m, +--- +m, =7, yu<h<-e+ Kt 


ist, eine lineare Basis (in Bezug auf konstante Koeffizienten) der invarianten 
Differentialformen des Grades r; 

2) alle m; sind ungerade; 

3) es ist m + m + --- + m, gleich der Dimension von G. 

Hierin ist unter anderem die folgende Tatsache enthalten, die dem Satz Ib 
(Nr. 3) entspricht: 

Jede invariante homogene Differentialform geraden Grades la&t sich aus in- 
varianten Differentialformen kleinerer Grade durch dubere Multiplikation und 
Addition erzeugen. 


6. Auch bei Beschrinkung auf Liesche Gruppen sind, soweit ich sehe, sowohl 
der Satz I als auch der schwichere Satz I’ neu. Allerdings waren diese Siatze 
bereits fiir cine so grofe und wichtige Reihe von Spezialfillen bekannt, daf 
ihre Giiltigkeit fiir beliebige Liesche Gruppen vermutet werden konnte. L. 





13 E. Cartan, Sur les invariants intégraux . . . [Annales Soc. polonaise de Math. 8 (1929), 
181-225 (= Selecta, 203-233)]; G. de Rham, Sur I]’analysis situs . . . (Journ. de Math. 10 
(1931), 115-200].—Man vergl. auch H. Weyl, a.a.O.°, 276 ff. 

14 Hier muB als Koeffizientenbereich der Kérper der reellen Zahlen dienen; man vergl.* 
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Pontrjagin, R. Brauer und C. Ehresmann haben nimlich, mit verschiedenen 
Methoden, die Bettischen Zahlen derjenigen einfachen geschlossenen Lieschen 
Gruppen bestimmt, welche den vier grofen Klassen in der Aufzihlung von 
Killing-Cartan angehéren, und diese Methoden liefern nicht nur den Satz I’, 
sondern auch den Satz I fiir die genannten Gruppen.” 

Ausgehend von diesem Resultat kénnte man wohl folgendermafen zu einem 
Beweis des Satzes I fiir alle geschlossenen Lieschen Gruppen gelangen: Man 
verifiziere die Giiltigkeit des Satzes auch an den fiinf einfachen geschlossenen 
“Ausnahme’’-Gruppen in der Killing-Cartanschen Aufzihlung; dann iibertrage 
man den—nunmehr fiir alle einfachen geschlossenen Lieschen Gruppen be- 
wiesenen—Satz auf alle geschlossenen Lieschen Gruppen, indem man die, aus 
der Cartanschen Theorie bekannte, Rolle ausniitzt, welche die einfachen Gruppen 
als Bausteine beliebiger Gruppen spielen. 

Aber ganz abgesehen von der Frage, ob die direkte Bestatigung des Satzes 
an den fiinf Ausnahme-Gruppen wirklich gelingt, wiirde ein solcher Beweis aus 
zwei Griinden nicht vollstandig befriedigen. Erstens wiirde er so umfangreiche 
und tiefgehende Teile der Theorie der kontinuierlichen Gruppen als Hilfsmittel 
verwenden, daf} dieser Aufwand in keinem rechten Verhaltnis zu dem elementar- 
topologischen Charakter des Satzes selbst stiinde. Zweitens wirde ein solcher 
Beweis in einer Verifizierung gipfeln; somit wiirde er zwar besonders konkrete 
Aufschliisse iiber diejenigen speziellen Mannigfaltigkeiten liefern, an denen die 
Verifizierung stattfindet—also iiber die Mannigfaltigkeiten der einfachen 
geschlossenen Lieschen Gruppen—, es wiirde aber wohl doch der Wunsch nach 
einem Beweis offen bleiben, welcher allgemeine Grinde fiir die Giiltigkeit des 
Satzes erkennen lieBe.”® 

Daher glaube ich, daf selbst dann, wenn die direkte Verifizierung des Satzes 
I an den fiinf Ausnahme-Gruppen und damit ein anderer Beweis fiir alle ge- 
schlossenen Lieschen Gruppen gelingt, doch unser Beweis, welcher fiir alle [-Man- 
nigfaltigkeiten gilt und infolgedessen aus der Lieschen Theorie nichts benutzt, 
auch fir die Lieschen Gruppen-Mannigfaltigkeiten willkommen ist. 


7. Andererseits wei man, da der Satz I gewisse Verschirfungen erlaubt, 
wenn man sich auf Gruppen-Mannigfaltigkeiten beschrankt; dann unterliegen 
nimlich die Zahlen m; , die im Satz I auftreten, gewissen Gesetzen; so folgt aus 
Siitzen von E. Cartan: entweder sind alle m; = 1 — dann ist die Gruppe 
Abelsch—, oder wenigstens ein m; ist gleich 3. Diesen Satz oder dhnliche Satze 





16 T,. Pontrjagin [C. R. Acad. Se. U.R.S.S. 1 (1935), 483-437 und C. R. Paris, 200 (1935), 
1277-1280].—R. Brauer [C. R. 201 (1935), 419-421] (man vergl. auch H. Weyl, a.a.O.°, 
232 ff.).—C. Ehresmann [C. R. 208 (1939), 321-323; 1263-1265]. 

16 Cartan’, (b), 26: ‘‘... Mais méme en nous bornant A la simple détermination des 
nombres de Betti des groupes simples, on ne devra pas s’estimer complétement satisfait 
si on arrive A faire cette détermination pour les cing groupes exceptionnels. .. . Il faut 
espérer qu’on trouvera aussi une raison de portée générale expliquant la forme si particu- 
liére des polynomes de Poincaré des groupes simples clos.”’ 

17 A.a.O.3: (a), 42-43; (b), 24. 
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mit unserer Methode zu beweisen, welche immer alle I'-Mannigfaltigkeiten 
gleichzeitig behandelt, ist prinzipiell unméglich; denn fiir T-Mannigfaltigkeiten 
unterliegen die m; iiberhaupt keiner Einschrinkung; es gilt nimlich der folgende 
Satz: 

Satz Il. Jedes Sphdren-Produkt 


ab lh” ae” a 


IV 


Bs 


in welchem die Dimensionszahlen m,, m2, ---,m, ungerade sind, ist eine 
l-Mannigfaltigkeit. 

Aus diesem Satz geht hervor, dai der Begriff der T-Mannigfaltigkeit nicht 
nur seiner Definition nach, sondern auch tatsiichlich viel allgemeiner ist als der 
Begriff der Gruppen-Mannigfaltigkeit: nach Satz II sind alle Sphiiren S24; 
T-Mannigfaltigkeiten, wihrend nach einem bekannten, socben erwaihnten Satz 
von Cartan unter allen Sphiren S,, allein S; und S; Gruppenraume sind. 


8. Die Aufgabe, diejenigen Ringe aufzuzihlen, welche als Homologie-Ringe 
von I-Mannigfaltigkeiten auftreten, ist durch die Sitze I und IT vollstindig 
gelost. 

Die Giiltigkeit des Satzes I] wird rasch im §1 durch direkte Angabe geeigneter 
stetiger Multiplikationen bestitigt. 

Im §2 werden Erzeugenden-Systeme beliebiger Homologie-Ringe betrachtet. 
Im Rahmen dieser Betrachtung wird der Satz I in zwei Teile zerlegt—Satz la 
und Satz Ib, von denen wir den zweiten schon in Nr. 3 ausgesprochen haben. 
Im Satz Ib (Nr. 15) tritt der Begriff des “maximalen” Elementes eines Ho- 
mologie-Ringes auf, der auch fiir andere Zwecke als unseren gegenwirtigen 
wichtig und brauchbar sein diirfte; wir werden sogleich noch auf ihn zuriick- 
kommen (Nr. 9). 

Der Ansatz zum Beweis der Sitze Ia und Ib, und damit des Satzes I, ist der 
folgende: man fasse die Punktepaare (p, g) von M als die Punkte p X gq der 
Produkt-Mannigfaltigkeit M x M auf; durch eine stetige Multiplikation pq 
in M, wie wir sie in Nr. 1 erklart haben, ist dann eine stetige Abbildung F von 
M X M in M bestimmt: F(p X q) = pq; diese Abbildungen F sind mit Hilfe 
des ‘Umkehrungs-Homomorphismus” zu untersuchen. Entsprechend diesem 
Ansatz werden zunichst im §3 einige einfache Eigenschaften des Ringes 
R(M X M) zusammengestellt; sodann wird im §4, nachdem an seinem Anfang 
kurz an die Theorie des Umkehrungs-Homomorphismus erinnert worden ist, 
der Beweis der Sitze Ia und Ib gefiihrt. 


9. Der schon erwihnte Begriff des maximalen Elementes ist der folgende: 
ein homogen-dimensionales Element eines Homologie-Ringes (M)_heifit 
maximal, wenn es nicht durch Multiplikation und Addition aus héherdimen- 
sionalen Elementen erzeugt werden kann. Im §5 werden die maximalen Ele- 
mente noch etwas niher betrachtet, und es werden ihnen jetzt die “minimalen”’ 
Elemente gegeniibergestellt: das sind diejenigen homogen-dimensionalen Ele- 
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mente v von R(M), welche keine Vielfachen w = u-v mit 0 < d(w) < d(v) besitzen. 
Die Untersuchung fihrt erstens leicht zu einem Satz tiber eine gewisse Dualitit 
zwischen den maximalen und den minimalen Elementen (Nr. 33) und zweitens, 
unter Benutzung des Umkehrungs-Homomorphismus, zu einer bemerkenswerten 
Invarianz-Eigenschaft der minimalen Elemente (Nr. 34). Diese Tatsachen, 
zusammen mit dem Satz Ib, liefern noch als Korollar den folgenden Satz, der 
eine kriiftige Verallgemeinerung der Tatsache darstellt, da eine Sphare gerader 
Dimension nicht als Gruppen-Mannigfaltigkeit auftreten kann: 

Satz III. In den l-Mannigfaltigkeiten sind die stetigen Bilder von Sphdéren 
gerader Dimension immer homolog 0. 

Zum Schlu® (Nr. 37) wird ein Problem formuliert, das durch die erwahnte 
Methode von Pontrjagin”’ angeregt ist und das fiir die weitere topologische 
Untersuchung der Gruppen-Mannigfaltigkeiten wichtig sein dirfte; es wird 
eine Vermutung ausgesprochen, in welcher die minimalen Elemente einer 
Gruppen-Mannigfaltigkeit eine Hauptrolle spielen. 


§1. Beweis des Satzes II. 


Der Satz II (Nr. 7) la®Bt sich in die folgenden beiden Teile zerlegen: 

Satz Ila. Fiir ungerades m ist die Sphdre S», eine T-Mannigfaltigkeit.* 

Satz IIb. Das topologische Produkt T * I’ zweter T-Mannigfaltigkeiten T 
und 1” ist selbst eine T-Mannigfaltigkett. 


10. Beweis des Saltzes Ila." Fir jeden Punkt gq der Sphire S,, bezeichne 


rz die Spiegelung der S,, an demjenigen Durchmesser, auf welchem q liegt; wir 
setzen pq = l1,(q) = r4(p). 

Die Abbildung 7, ist topologisch, also ist ¢- = +1 (und zwar, wie man leicht 
sieht, c- = —1). Wir behaupten weiter: c; = +2 (und zwar ist c,; = +2). 

p sei ein fester Punkt auf S,, ; durch l, wird jeder Grofkreis, auf dem p liegt, 
auf sich abgebildet, und zwar folgendermafen: fiihrt man auf dem Kreis eine 
Winkelkoordinate mit p als Nullpunkt ein, und ist dann gq der Punkt mit der 
Koordinate a, so hat pg = l,(q) die Koordinate 2a. Daraus ergibt sich: sowohl 
die offene Halbkugel H von S,,, deren Mittelpunkt p ist, als auch ihre anti- 
podische Halbkugel H’ wird topologisch auf S,, — p’ abgebildet, wobei p’ der 
Antipode von p ist; die gemeinsame Randsphire von H und H’ geht in den 
Punkt p’ iiber. Bezeichnen wir mit ¢g’ immer den Antipoden von gq, so ist der 
Zusammenhang swischen den Abbildungen der beiden Halbkugeln H und H’ 
durch die Beziehung 1,(q’) = 1,(q) gegeben. Nun hat bei ungeradem m die 
Involution der S,, , welche je zwei Antipoden vertauscht, den Grad + 1; daher 
haben, wenn wir die Orientierungen von H und H’ durch eine feste Orientierung 
der S,, festlegen, die topologischen Abbildungen 1, von H und H’ den gleichen 





18 Daf fiir gerades m die S,, nicht '-Mannigfaltigkeit ist, ist in Nr. 4 (a) und in Satz 
III (Nr. 9) enthalten. 

19 Wiedergabe des Beweises von ‘‘Satz IV’ aus meiner Arbeit in den Fund. Math. 25 
(1935), 427-440. 
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Grad ¢« = +1 (und zwar, wie man leicht sieht, +1). Daher hat die Abbildung 
|, der ganzen S,, auf sich den Grad 2e = +2 (und zwar +2). 


11. Beweis des Saizes IIb. Die stetigen Multiplikationen in l und I” seien 
durch 


pq = 1,(q) = rep) baw, p’q’ = 1,-(q’) = ry(p’) 


gegeben; die zugehérigen Grade seien ¢;, ¢,, cv, ¢, ; sie sind simtlich # 0. 
Wir definieren in der Mannigfaltigkeit [ x I’, deren Punkte mit p X p’, g X 
q’, --- bezeichnet werden, eine stetige Multiplikation durch die Festsetzung 


(p X p’)-(q X W') = pg X VY = Loxw (GQ X 1) = Rexa(P X D'); 
die zugeh6rigen Grade seien C,, Cz. Der Satz ist bewiesen, sobald gezeigt ist: 
Cr =cr-er, Ce = Cr-Cy. 


Die Giiltigkeit dieser Gleichheiten ist in dem folgenden Hilfssatz enthalten: 

fund f’ seien Abbildungen” der Mannigfaltigkeiten’ A und A’ in die Mannig- 
faltigkeiten B bzw. B’, welche die gleichen Dimensionen haben wie A bzw. A’; 
die Grade von f und f’ seien c baw. c’.. Dann hat die Abbildung F von A X A’ 
in B X B’, die durch 


F(p X p’) = f(p) X f'(p’) 


gegeben ist, wobei p, p’ die Punkte von A bzw. A’ durchlaufen, den Grad ce’. 

Fiir den Beweis ersetzen wir f und f’ durch so gute simpliziale Approximationen 
fi, fi, da® auch diese die Grade c, c’ haben, und da® auch die Abbildung F;, 
von A X A’ in B X B’, die durch 


Fi(p X p’) = filp) X iy’) 


gegeben ist, den gleichen Grad C hat wie F. Die Grundsimplexe der Zerlegungen 
von A, A’, B, B’, welche den simplizialen Abbildungen f, , f: zugrunde liegen, 
seien mit u;, u;, vs, v7 bezeichnet; dann bilden die Produkte u; X u; und 
v, X v; die Grundzellen von Zellenzerlegungen der Mannigfaltigkeiten A X A’ 
baw. B X B’. Durch F, wird jede Zelle u; X u; affin abgebildet, und zwar 
folgendermafen: ist 


fi(us) = 0 oder fi(u;) = 0, 


wird also die Dimension wenigstens eines der Simplexe wu; , u; durch die Abbil- 
dung f, oder f; erniedrigt, so wird auch die Dimension der Zelle u; X u; durch 
die Abbildung F; erniedrigt, es ist also Fi(u; X u;) = 0; ist 


Fi(ui) = Ur, fi(uj) = ev; , dina +1, = +1, 





2° Alle vorkommenden ‘‘Abbildungen’”’ von Mannigfaltigkeiten sollen eindeutig und 
stetig sein. 
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sind also die beiden Abbildungen f; , f; nicht-singular, so ist auch die Abbildung 
F, von u; X u; nicht-singulir, und es ist, wie sich aus bekannten Vorzeichenregeln 
bei der Bildung topologischer Produkte ergibt, 


Fi(u; X uj) = €€ (vy X v2). 


Jetzt lehrt eine leichte Abzihlung: die algebraische Bedeckungszahl—d.h. 
die Anzahl der positiven Bedeckungen, vermindert um die Anzahl der negativen 
Bedeckungen—einer festen Grundzelle von B X B’, etwa der Zelle v; X v;, also 
der Grad C von F,, ist gleich dem Produkt der algebraischen Bedeckungszahlen 
von v, und v; bei den Abbildungen f; baw. f; , also gleich cc’. 


§2. Irreduzible Erzeugenden-Systeme und maximale Elemente eines 
Homologie-Ringes. Umformung des Satzes I. 

12. Vorbemerkungen. Es sei M eine n-dimensionale Mannigfaltigkeit. Wie 
schon in Nr. 3 festgesetzt, bezeichnen wir die Dimension eines Elementes z von 
9(M) mit d(z) und verstehen unter seiner dualen Dimension die Zahl 6(z) = 
n — d(z). 


Bekanntlich ist fiir homogen-dimensionale z, z’ auchz-z’ homogen-dimensional 
und 
(2.1) 5(z-2’) = 8(z) + 6(z’)”) 
sowie 
(2.2) 2’.z = 42-2’, 
und zwar”) 
(2.3) zz = (—1)O8O 2. 2 
also speziell 


(2.4) . bei ungeradem 46(z). 


13. Erzeugenden-Systeme. Die homogen-n-dimensionalen Elemente von 
R(M), also die rationalen Vielfachen der Eins des Ringes, nennen wir 
die “‘skalaren’”’ Elemente von #(M). 

Unter einem “Erzeugenden-System” von §(M) verstehen wir ein solches 
System von homogen-dimensionalen, nicht-skalaren Elementen 2 , 22, --- , 21, 
dafi man alle erhalt, wenn man auf 2, 22, --- , 2, und 1 die Operationen der 
gegenseitigen Multiplikation, der Multiplikation mit rationalen Koeffizienten 
und der Addition ausiibt. 

Auf Grund der Regel (2.2) kann man jedes Element von #(M) auf wenigstens 
eine Weise als Polynom in den 2 , d.h. als Summe von Ausdriicken 


(2.5) t-zy'.zg?. ... -27%, a = 0, 


mit rationalen Koeffizienten ¢ schreiben. 





*t Dem Null-Element des Ringes %(M) wird jede Dimensionszahl zugeschrieben. 
#2 Man vergl. z.B. Lefschetz, Topology [New York 1930], 166. 
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Ein Erzeugenden-System heift “irreduzibel’’, wenn keines seiner echten 
Teilsysteme bereits ein Erzeugenden-System ist. 

Offenbar ist in jedem Erzeugenden-System wenigstens ein irreduzibles 
Erzeugenden-System enthalten. 

Man zeigt iibrigens leicht, daf8 die Anzahl / der Elemente eines irreduziblen 
Erzeugenden-Systems von §%(M) nicht von diesem speziellen System abhingt, 
sondern eine Invariante von M ist; wir werden diese Tatsache, die wir vorliufig 
nicht benutzen, spaiter beweisen (Nr. 31). 


14. Mazimale Elemente. Ein Element z von 8(M) hei®t ‘maximal’, wenn 
es 1) homogen-dimensional und nicht-skalar ist, und wenn es 2) nicht in dem 
Teilring von R(M) enthalten ist, der von den homogen-dimensionalen Ele- 
menten z’ von #(M) mit d(z’) > d(z) erzeugt wird.” 

Wir behaupten: Jedes Element z, eines irreduziblen Erzeugenden-Systems 
(z1, 22, +--+, 2) ist maximal. 

Beweis: Daf z, homogen-dimensional und nicht-skalar ist, ist in der Defini- 
tion des Erzeugenden-Systems enthalten. Ware z nicht maximal, so wire 
z, Element des Ringes Ul, der von allen homogen-dimensionalen Elementen z’ 
mit d(z’) > d(z,) erzeugt wird. Nun 1a®t sich aber jedes dieser z’ als Polynom 
in den Erzeugenden 2; , 22, --- , 2, schreiben, und hierbei tritt aus Dimensions- 
grinden das Element z nicht auf; aus 2 ¢ Ul wiirde daher folgen, daf auch z, 
selbst ein Polynom in den von 2, verschiedenen Elementen des Systems (2; , 
Z2, +++, 2.) Ware; dann wiirde aber dieses System, wenn man aus ihm 2 weglie£e, 
immer noch ein Erzeugenden-System bleiben—entgegen seiner Irreduzibilitits- 
Eigenschaft. 


15. Umformung des Satzes I. 
Satz Ia. (21, z2,---, 2) set ein irreduzibles Erzeugenden-System des Ringes 
R(T) einer T-Mannigfaltigkeit. Dann ist 


(2.6) Z1-Ze- +--+ -2, #0. 


Satz Ib. z sei ein maximales Element des Ringes R(T) einer T-Mannigfaltig- 
keit. Dann ist 6(z) ungerade. 

Wir werden diese beiden Sitze im §4 beweisen. Jetzt wollen wir nur zeigen, 
da aus ihnen der Satz I (Nr. 2, 3) folgt; dies wird geschehen sein, sobald wir 
bewiesen haben: 

Es sei (21, 22, --- , 21) ein irreduzibles Erzeugenden-System von R(T), und es 
set bekannt, da®B die Sdtze Ia und Ib gelten; dann haben 2 , 22, --- , 2. die in 
Nr. 3 genannten Eigenschaften. 

Zunichst ergibt sich aus Nr. 14, daf alle Elemente z; maximal, also aus Satz 
Ib, daB alle Zahlen 


(2.7) 5(z;) = m, ¢=1,2,---,l, 





* Insbesondere ist jedes homogen (n — 1)-dimensionale Element, das ~ 0 ist, maximal. 
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ungerade sind. Nach (2.3) ist daher 

(2.8) 2j°2i = —2;-2;, 
also speziell 

(2.9) 2;-2; = 0. 


Da die z; ein Erzeugenden-System bilden, kann man jedes Element von R(T) 
als Summe von Ausdriicken (2.5)—mit LZ = l—darstellen; infolge von (2.9) 
kann man sich dabei aber auf die Exponenten a, = 0 und a, = 1 beschrinken; 
das heiSt: jedes Element z von R(T) lat sich auf wenigstens eine Weise als 
lineare Verbindung mit rationalen Koeffizienten der Elemente 


(2.10) 1; 2:3 2i,-2ig (le < te); 2iy+Zig- Zig (1 < te < U3)j +++ 5 ZreZar ees oy 
darstellen, also in der Form 
(2.11) 2=t+ Do titi t+ DY bijigtiy Zig + DD bigigig2iy Zig? Zig + 

ee ee ore 


wobei die Koeffizienten ¢, ¢; , tii, , --- rational sind und die Indices die unter 
(2.10) angedeuteten Bedingungen erfiillen. Wir haben zu zeigen, da® die Ele- 
mente (2.10) eine additive Basis bilden, d.h. da® sie linear unabhangig sind (in 
Bezug auf rationale Koeffizienten), mit anderen Worten: in einer Darstellung 
(2.11) des Elementes z = 0 verschwinden alle Koeffizienten auf der rechten 
Seite. 

Ks sei also 


Q=t+ > tizi + } B bi,ig2t, *2ig > bs igig@iy *2ig* Zig 1 
(2.119) 
+ o-++ + tye. Zi-Zee + +. 


Wir multiplizieren mit 2-22. --- -2; ; auf Grund des assoziativen Gesetzes und 
der Regeln (2.8), (2.9) verschwinden auf der rechten Seite alle Produkte, in 
denen ein Index zweimal auftritt, und es entsteht daher die Gleichung 


O = t-2-Z.- +++ +273 


nach Satz Ia folgt hieraus—da der Koeffizientenbereich ein Kérper ist—: 


t= 0. 


Wir betrachten einen Index 7 und multiplizieren (2.119) mit dem Produkt aller 
der z; , fiir welche 7 ¥ 7 ist; aus analogen Griinden wie soeben entsteht: 


0 = ty-21+2Z2- eee *21, 


also folgt wie soeben: 








BO = 


SE 
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Wir betrachten zwei Indizes 2; , 72 und multiplizieren (2.11)) mit dem Produkt 
aller der z;, fiir welche 7 ¥ 0%; und 7 ¥ 7% ist; es ergibt sich 


ti, > 0. 


So fortfahrend erkennt man, daf} in der Tat alle Koeffizienten auf der rechten 
Seite von (2.119) gleich 0 sind. 

Somit bilden die Elemente (2.10) eine Basis, und die Elemente von R(T) 
lassen sich in eineindeutiger Weise mit den Ausdriicken (2.11) identifizieren. 
Da& fiir die Multiplikation die antikommutative Regel (2.8) gilt, wurde schon 
gezeigt. Fir den vollstaéndigen Beweis des Satzes I fehlt nur noch die Besti- 
tigung der in Nr. 3 angegebenen Dimensions-Regeln. 

Aus dem Satz Ia folgt, daf& die Dimensionszahl d(z;:z2- --- -z;) = dy wohl- 
bestimmt und 2 0 ist; fiir 7 << % < --- < 7, ist d(z;,-2:,- --- -2:,) 2 dy, also 
ist infolge von (2.11) auch d(z) 2 dp fiir jedes Element z von (I); da es 0-dimen- 
sionale Elemente z gibt, ist daher do = 0. Dies ist, wenn I die Dimension n 
hat, gleichbedeutend mit 6(z;-z2- --- -z;) = n; wenn wir 6(z;) = m; setzen, ist 
daher nach (2.1) 


n =m + Mm +--- +m. 
Allgemein ergibt sich fiir 7 < % < --- < 7%,, wieder nach (2.1), 
Mig -Bi,: «++ +g) = 8 — Mente: --- -%%) 
=n— mM, — M, — --- — M,. 


Daf schlieBlich alle m; ungerade sind, wurde schon durch (2.7) festgestellt. 
Damit ist der Satz I vollstaéndig bewiesen—unter der Voraussetzung, daf die 
Sitze Ia und Ib gelten, die im §4 bewiesen werden sollen. 


16. Hilfssdtze.* Wir stellen hier noch einige einfache Tatsachen zusammen, 
die wir spéiter (§4) brauchen werden. 

M sei eine n-dimensionale Mannigfaltigkeit und (2: , z2 , - - - , 2:) ein beliebiges 
Erzeugenden-System von #(M). Unter Ul verstehen wir die Menge aller der- 
jenigen Elemente, die sich in der Gestalt 


We-Z. + W3-2%3 +--+ + Wiz, 


schreiben lassen, wobei die w; beliebige Elemente von #(M) sind. 

Wir behaupten: 

(a) Jedes homogen-dimensionale Element z mit d(z:) < d(z) < n gehért zu U. 

(b) Ist 2: in U enthalten, so ist das Erzeugenden-System (2, 22, +++ , 21) 
reduztbel. 

Beweis von (a): Man schreibe z als lineare Verbindung von Potenzprodukten 
zi'-2g*. ... -zf' (mit rationalen Koeffizienten); da z und alle z; homogen- 





** Die Hilfssitze der Nummern 16-18 und 20-22, die an und fiir sich kaum Interesse 
verdienen, werden erst in den Nummern 28 und 29 angewandt. 
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dimensional sind, kann man sich dabei offenbar auf solche Potenzprodukte 
beschrinken, die selbst die Dimension d(z) haben; fiir ein solches Potenzprodukt 
ist dann nach (2.1) 


a, -6(21) + az-5(Z2) +--+ + a+ 6(21) = 6(z); 


da 6(z) < 6(z) ist, folgt hieraus a; = 0, und da 6(z) > 0 ist, folgt weiter, da8 
nicht a. = --- = a, = O ist; dies bedeutet: jedes Potenzprodukt enthalt wenig- 
stens eines der Elemente 22, --- , 2, als Faktor, d.h. z gehért zu U. 

Beweis von (b): Es sei. z; € U, also 


l 

(2.12) a= Do wy-253 

ta 
dabei kann man, da die z; homogen-dimensional sind, auch die Elemente w; als 
homogen-dimensional und d(w;-z;) = d(z), also 6(w;) + 6(z;) = 6(a), anneh- 
men. Da die 6(z;) > 0 sind (Nr. 13), sind daher alle 6(w;) < 6(z). Hieraus 
folgt, analog wie im Beweis von (a): stellt man w; als lineare Verbindung von 
Potenzprodukten zj'-z2?- --- -z7' dar, so ist immer a; = 0; jedes w; ist also 
durch z,---,2, allein auszudriicken, und nach (2.12) ist dann auch % 
durch z,,--- , 2, auszudriicken. Aber dann erzeugen bereits 22, --- , 2: den 
Ring R(M). 


17. Es sei z ein homogen-dimensionales Element von #(M) und d(z) < n. 
Unter ‘% verstehen wir die Menge aller derjenigen Elemente, die sich als Sum- 


men von Produkten w-v schreiben lassen, wobei die Elemente v homogen- 
dimensional mit 


(2.13) d(v) ¥ d(z), div) <n 


und die Elemente w beliebig sind. 
Wir behaupten: 
(ec) Ist z in B enthalten, so ist z nicht maximal. 
Beweis: Es sei z in ¥ enthalten, also 


(2.14) z= DS Wr: | 


hierin sind die v, homogen-dimensional und erfiillen (2.13); da z homogen- 
dimensional ist, diirfen wir auch die w;, als homogen-dimensional und d(w,-v,) = 
d(z), also 6(wa) + 6(v,) = 4(z), fiir alle h annehmen. Dann ist 6(v,) S 6(2), 
also, da nach (2.13) 5(v,) ¥ 6(z) ist, 5(un) < 6(z), d(v,) > d(z). Da nach (2.13) 
andererseits 5(v,) > 0 ist, ist auch 6(w,) < 6(z), d(w,) > d(z). Aus d(v,) > d(z) 
und d(w,) > d(z) folgt nach (2.14), da® z nicht maximal ist. 


18. Bemerkung wiber Ideale in R(M). Da die Multiplikation in dem Ring 
R(M) im allgemeinen nicht kommutativ ist, hat man unter den Idealen des 
Ringes zwischen Links-, Rechts- und zweiseitigen Idealen zu unterscheiden. 
Jedoch gilt folgender Satz: 








oT Ae 
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Es seien 21, 22, --+ , 2m homogen-dimensionale Elemente von 8(M) und ¥ 
das von ihnen erzeugte Links-Ideal, d.h. die Menge aller Elemente der Gestalt 


(2.15) Wr Ly + Wee Te + +--+ + Wm Tm 


mit willkiirlichen Elementen w; ; dann ist X zugleich Rechts-Ideal, also zweiseitig. 

Zum Beweis ziehen wir eine volle additive Basis (Z; , Z2 , --- , Z,) von #(M) 
heran, deren Elemente Z, wir als homogen-dimensional annehmen diirfen. 
Dann ist die Menge % aller Elemente (2.15) identisch mit der Menge aller 
linearen Verbindungen der Produkte Z,-x; mit rationalen Koeffizienten (h = 
1, 2,---,q; 7 = 1, 2,---,m). Nach (2.2) ist aber, da die Z, und die 2; 
homogen-dimensional sind, Z,-7; = =a;-Z, , und daher ist ¥ auch die Menge 
aller linearen Verbindungen der Produkte 2;-Z, , also die Menge aller Elemente 


X1-Wy + Xo-We + +--+ + Lm:Wm 


mit willkiirlichen w; ; diese Menge ist aber das von 2, 22, --- , 2m erzeugte 
Rechts-Ideal. 

Auf Grund dieses Satzes sind insbesondere die Mengen U und &, die wir in 
Nr. 16 und 17 betrachtet haben, zweisettige Ideale. 


§3. Eigenschaften des Ringes #(M x M). 


In Nr. 8 wurde die Rolle angedeutet, welche die Produkt-Mannigfaltigkeit 
I X I beim Beweise des Satzes I spielt. Der gegenwartige Paragraph enthalt 
lediglich eine, auf diesen Zweck zugeschnittene, Zusammenstellung und Formu- 
lierung bekannter Eigenschaften des Ringes #(M xX M); dabei bezeichnet M 
eine beliebige Mannigfaltigkeit. Der Koeffizientenbereich ist wie immer der 
Korper der rationalen Zahlen.” 


19. Zu je zwei Elementen z, y von #(M) gehért ein Element x X y von 
R(M X M). Diese Produktbildung ist mit der Addition distributiv verkniipft. 
Sind x und y homogen-dimensional, so ist auch x X y homogen-dimensional 
und d(x X y) = d(x) + d(y). 

Umgekehrt la®t sich jedes Element von R(M X M) als >> (a, X ys) dar- 
stellen; genauer: ist das System (Z;, Zz, --- , Z,) eine volle additive Basis von 
R(M)—d.h. die Vereinigung von Homologiebasen aller Dimensionen—, so 
bilden die Z; X Z, eine volle additive Basis von R(M xX M); die Elemente von 
N(M X M) sind also in eineindeutiger Weise als }> t(Z; X Z,) mit rationalen 
tj, auszudriicken. Damit sind die additiven Eigenschaften von #(M xX M) 
gegeben. 





*° Wegen der additiven Eigenschaften von R(M X M) vergl. man z.B. Alexandroff-Hopf, 
a.a.0.6, Kap. VII, §3; jedoch hat man die dortigen Betrachtungen dadurch abzuindern 
(und wesentlich zu vereinfachen), da8 man rationale Koeffizienten verwendet; wegen der 
multiplikativen Eigenschaften, insbesondere unserer Formel (3.1), vergl. man Lefschetz, 
a.a.0.22, Chapter V, §3, insbesondere Formel (21).—Ubrigens darf in unserem ganzen §3 
der Koeffizientenbereich ein beliebiger Kérper sein. 
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Die Multiplikation ist nun durch die Regel 
(3.1) (x X y)-(2" X y’) = (1-2! K y-y’), 


welche fiir homogen-dimensionale x, y’ gilt, und durch die distributiven Gesetze 
vollstaindig bestimmt. 


20. Hilfssdtze.* % sei eine additive Untergruppe von #(M), die rational 
abgeschlossen ist, d.h.: aus x eX folgt tx ¢ X fiir jede rationale Zahl ¢. Ferner 
seien z, y, und zwar y # 0, zwei solche Elemente von §#t(M), daf& sich das 
Element y X x von R(M X M) in der Form 


(3.2) yXr= > (yn X ata) 


darstellen laBt, wobei die x, Elemente von X sind, wihrend von den Elementen 
yn nichts vorausgesetzt wird. 
Behauptung: Dann ist x « X. 
Beweis: (Z;, Ze, --- , Z,) sei eine volle additive Basis in R(M). Es sei 
z= adi, y = 2 0;Z;, 
a 


7 
Th » Cri Zi ; te x dy; Z; 
a 2 


mit rationalen a, b, c, d; dann folgt aus (3.2), da® 


byai = Dy dren, 
h 
also 


(3.3) bj = DY darn 
h 


fiir jedes j ist. Da nach Voraussetzung y ¥ 0 ist, ist wenigstens ein b; # 0; 
es sei etwa b} + 0. Dann folgt aus (3.3) 


a TE | 
a by 


21. X sei ein (zweiseitiges) Ideal in #(M), welches von homogen-dimen- 
sionalen Elementen 2, 22, --- , %» erzeugt wird (man vergl. Nr. 18). Unter 
X* verstehen wir die Menge aller derjenigen Elemente von R(M xX M), welche 
sich in der Gestalt 


/ 
(3.4) DX (yn X 21) 
h 
schreiben lassen, wobei die x, Elemente von %, die y, beliebige Elemente von 


R(M) sind. 
Behauptung: X* ist ein zweiseitiges Ideal in R(M X M). 
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Beweis: Es sei wieder (Z;, Z2, --- , Z,) eine volle additive Basis in #(M); 
dann ist ¥ identisch mit der Menge aller linearen Verbindungen der Produkte 
Z;-x; (mit rationalen Koeffizienten) und X* daher identisch mit der Menge 
aller linearen Verbindungen der Elemente Z, X Z;-2; (j, k = 1, 2,---,q; 
i = 1,2,---,m). Da wir die Z; als homogen-dimensional annehmen diirfen, ist 
aber nach (3.1) 

Zi. x Z 5X3 = +(Z; x Z;)-(1 x Xi); 


die Z, X Z; bilden eine volle additive Basis; folglich ist ¥* die Menge aller 
Summen 


ie W;-(1 X 2), 


wobei W; willkiirliche Elemente von (MM xX M) sind; dies bedeutet aber: 
%* ist das Links-Ideal, das von den Elementen 1 X 2; ,7 = 1, 2, --- , m, erzeugt 
wird. Da die x; homogen-dimensional sind, sind auch die Elemente 1 X 2; 
homogen-dimensional, und nach Nr. 18 ist ¥* daher ein zweiseitiges Ideal. 


22. Wir bringen den hiermit bewiesenen Satz in Verbindung mit dem Satz 
aus Nr. 20. Ein Ideal ¥ hat die in Nr. 20 genannte Eigenschaft der rationalen 
Abgeschlossenheit, da man ja die rationalen Zahlen als die skalaren Elemente- 
d.h. die rationalen Vielfachen des Eins-Elementes—von 9(M) auffassen kann. 
Daher ergibt sich aus Nr. 20 und 21 das folgende Lemma: 

Die Ideale X und X* sollen wie in Nr. 21 erkldrt sein; ferner seten x und y Ele- 
ment von R(M), fiir welche 

y #0 


yXz2z0 mod X* 


gilt; dann ist 
mod &. 


os 
Il 
an) 


23. Die Homomorphismen A und P. M sei eine n-dimensionale Mannig- 
faltigkeit und (Z,, Zz, --- , Z,) eine volle additive Basis von R(M); wir diirfen 
annehmen, da alle Z; homogen-dimensional sind, dai Z; = 1 und d(Z;) < n 
fir7 > list. Dadie Z; X Z, eine Basis von #(M X M) bilden, besitzt jedes 
Klement Q von 8(M xX M) eine und nur eine Darstellung 


(3.4) Q = 2 tx(Z; x Zi) 
mit rationalen ¢;,. Wir setzen 

(3.5) A(Q) = do eZ 
Zum Beispiel ist fiir jedes Element y von #(M) 
(3.6) AA xXy=y 
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und, falls d(x) < n ist, 
(3.7) A(x X y) = 0; 
man bestitigt (3.6) und (3.7) einfach, indem man y, baw. x und y, als lineare 
Verbindungen der Z, schreibt. 

Wir fassen A als Abbildung des Ringes #(M xX M) in den Ring R(M) auf 
und behaupten: A ist eine Homomorphie. 

Dai A additiv homomorph, d.h. dai 


A(Qi + Qe) = A(Qi) + A(Qz) 
ist, liest man unmittelbar aus (3.4) und (3.5) ab. Fiir den Beweis der multi- 
plikativen Homomorphie, d.h. der Giiltigkeit der Gleichheit 
(3.8) A(Qi-Qe) = A(Q1)- A(Q2) 


darf man sich infolge der Distributivitaét und der schon konstatierten additiven 
Homomorphie auf den Fall beschrinken, da Q; , Q@2 Elemente der Basis Z; X Z; 
von R(M X M) sind. Es sei also 
Qi = Zi X Zi, Qe =2Z;X Z. 
Ist h = 7 = 1, so ist einerseits 
A(Qi) = Zi, = A(Qz) = Ze, 

andererseits nach (3.1) 

Q1-Qe = 1X Zi-Zx, 
also nach (3.6) 

A(Qi-Qe) = Zi-Ze ; 
folglich gilt (3.8). Ist nicht h = 7 = 1, so ist wenigstens eines der Elemente 
A(Q;), A(Qe2) gleich Null, also ist die rechte Seite von (3.8) Null; andererseits 
ist nach (3.1) 


Qi1-Q2 = +Z,-Z2; X Zi-Z;., 
und hierin ist d(Z;,-Z;) < n; daher ist nach (3.7) auch die linke Seite A(Q:-Qs) 
von (3.8) gleich Null.—Damit ist die Homomorphie-Eigenschaft von A bewiesen. 
Setzt man im Anschlu® an (3.4) 


(3.9) P(Q) = pm tn Z;, 


so ergibt sich ganz analog: P ist eine homomorphe Abbildung von R(M Xx M) 
in R(M). 


24. Aus (3.4), (3.5), (3.9) liest man ab, da fiir jedes Element Q von 
R(M X M) Gleichungen 


Q = (1 X AQ)) + ke (Z; X Y;), d(Z;) <n, 
(3.10) ; 


Q = (P(Q) X 1) + X (XiX Zi), d(Z) <n 


gelten. 
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Ferner ist klar: ist Q homogen (n + r)-dimensional, so sind A(Q) und P(Q) 
homogen r-dimensional. 

Es sei jetzt Q homogen (n + r)-dimensional und r < n. Dann ist in (3.4) 
ty, = 0, und daher erhalt (3.4) mit Hilfe von (3.5) und (3.9) die Gestalt 


Bl) Q@=0xXAQ@) +(P@ x1) + rs D talZ; X Zi). 


Da die Z; homogen-dimensional sind, sind auch die Z; K Z;, homogen-dimen- 
sional, und es ist d(Z; X Z%) = d(Z;) + d(Z;); daher sind in (3.11) nur solche 
t;, * 0, fir welche 

d(Z;) + d(Z,) =n+r 
ist; ferner ist in (3.11), daz > 0, k > O ist, immer 

d(Z;) <n, d(Z.) <n 
und folglich 

d(Z,.) > 1, d(Z;) > rr. 
Mithin la@t sich (3.11) auch so ausdriicken: 

Q= (1 x AQ) + (PQ XI) +2 xX w), 


(3.12) <n, yx homogen-dimensional mit 


r < d(x.) <n, r<d(yr,) <n. 


Wir fassen zusammen: Es gibt zwei solche Homomorphismen A und P des 
Ringes R(M X M) in den Ring R(M), dab jedes Element Q von R(M XK M) 
Darstellungen (3.10) und daB jedes homogen (n + r)-dimensionale Element Q 
mit r < n eine Darstellung (3.12) besitzt. 

Es ist tibrigens leicht zu sehen, da die Abbildungen A und P, die wir durch 
(3.5) und (3.9) unter Benutzung einer speziellen Basis {Z;} erklirt haben, von 
der Wahl dieser Basis unabhingig sind. 


§4. Beweis des Satzes I. 


25. Der Umkehrungs-Homomorphismus. M und M’ seien beliebige Mannig- 
faltigkeiten,” und F sei eine Abbildung von M in M’. Dann bewirkt F eine 
Abbildung des Ringes 3t(M) in den Ring #(M’); wir bezeichnen auch diese 
Ring-Abbildung mit F; sie ist iibrigens additiv homomorph, jedoch im allge- 
meinen nicht multiplikativ homomorph. 

Es gilt der Satz: 


°° H. Hopf, Zur Algebra der Abbildungen von Mannigfaltigkeiten (Journ. f.d.r.u.a. Math. 
163 (1930), 71-88].—Neue Begriindung und Verallgemeinerung des Umkehrungs-Homo- 
morphismus: H. Freudenthal, Zum Hopfschen Umkehrhomomorphismus [Annals of Math. 
(2) 38 (1937), 847-853]; ferner: A. Komatu, Uber die Ringdualitat eines Kompaktums [Téhoku 
Math. Journ. 43 (1937), 414-420]; H. Whitney, On products in a complex [Annals of Math. 
(2) 39 (1938), 397-432], (Theorem 6).—Die Eigenschaft 3 unseres Textes ist in meiner 
zitierten Arbeit nicht hervorgehoben, da dort nur gleichdimensionale Mannigfaltigkeiten 
betrachtet werden; sie ergibt sich aber unmittelbar aus jeder einzelnen der verschiedenen, 
in den soeben genannten Arbeiten enthaltenen, Definitionen von @; iiberdies ist sie eine 
Folge der Eigenschaft 2; hierzu vergl. man Nr. 11 meiner Arbeit ‘‘Hin topologischer Beitrag 
cur reellen Algebra’ [Com. Math. Helvet.; erscheint nichstens]. 
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Es existiert cine Abbildung ® des Ringes R(M’) in den Ring R(M) mit den 
folgenden drei Eigenschaften: 

1) & ist ein additiver und multiplikativer Homomorphismus ; 

2) & ist mit F durch die Funktionalgleichung 


(4.1) F((z)-2) = z-F(z) 


verkniipft, in welcher x cin beliebiges Element von R(M) und z ein beliebiges Ele- 
ment von R(M") ist; 

3) ist z homogen-dimensional, so ist auch &(z) homogen-dimensional, und zwar 
ist 6(@(z)) = 6(z), also 


d(®(z)) = d(z) + d(M) — d(M’). 


® heift der ‘‘Umkehrungs-Homomorphismus”’ von F. 


26. Ansatz zum Beweis der Sdtze la und Ib. In der n-dimensionalen Mannig- 
faltigkeit I’ sei eine stetige Multiplikation gegeben (Nr. 1). Da wir die Punkte- 
paare (p, g) von I als die Punkte p X q der Produkt-Mannigfaltigkeit [ X T 
deuten kénnen, ist die stetige Multiplikation gleichbedeutend mit einer Ab- 
bildung F vonT X T in T; diese ist durch F(p X q) = pq bestimmt. 

Wie in Nr. 25 bezeichnen wir auch die durch F bewirkte Abbildung des Ringes 
R(T X T) in den Ring R(T) mit F. Das Element von (I), das durch einen 
einfach gezihlten Punkt reprasentiert wird, heiSie p. Das mit einer rationalen 
Zahl c multiplizierte Eins-Element von (1) bezeichnen wir kurz mit c. Dann 
sind die Grade c; und ¢, , die in Nr. 1 erklart worden sind, offenbar durch 


(4.2) FipX1l) =a, FAXp)=c6 


charakterisiert. 
® sei der Umkehrungs-Homomorphismus von F. Dann folgt aus (4.1) und 
(4.2) 


(4.3a) F(®(z)-(p X 1)) = cz, 
(4.3b) F(®(z)-(1 X p)) = cz 


fiir jedes Element z von (I). 
Die Homomorphismen A und P sind in Nr. 23, 24 erklart worden; wir setzen 


A(z) = A(z), — P®(z) = plz) 


fiir jedes Element z von (1); dann sind \ und p Homomorphismen des Ringes 
R(T) tn sich, und nach Nr. 24, (3.10), gelten fiir jedes Element z von (I) 
Gleichungen 


(4.4a) &(z2) = (1X A@)) + DZ x Y), d(Z) <n, 


(4.4b) ®(z) = (oz) X I + X (Xi X Zi), (Zi) <n. 
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Nach Nr. 25, 3), ordnet 6, da d(T! X T) = 2n, d([) = n ist, jedem homogen 
r-dimensionalen Element von R(T) ein homogen (n + r)-dimensionales Element 
von R(T X T) zu; nach Nr. 24 ordnen A und P den homogen (n + r)-dimen- 
sionalen Elementen von (I X I’) homogen r-dimensionale Elemente von R(T) 
zu; folglich sind \ und p dimensionstreu. 


27. Jetzt-sei 
(4.5) c, ~ 0, c, ¥ 0, 


also T eine T-Mannigfaltigkeit. Wir behaupten: dann sind \ und p Auto- 
morphismen™ von R(L). 
Beweis: Ist A(z) = 0, so ist nach (4.4a) 


®(z) = >> (Z; X Y,) mit d(Z;) < n; 
aus d(Z;) < n folgt Z;-p = 0; folglich ist, nach (3.1), 


#(z)-(p X 1) = 0, 
also nach (4.3a) 
Cz = 0, 


also nach (4.5), da der Koeffizientenbereich ein Kérper ist, 
z=0. 


Das bedeutet: der Homomorphismus ist eineindeutig. Folglich ist die De- 
terminante der Substitution, welche durch ) auf eine volle additive Basis von 
R(T) ausgetibt wird, nicht Null. Da der Koeffizientenbereich ein Kérper ist, 
folgt hieraus: \ ist eine Abbildung von R(T) auf sich. Somit ist \ ein 
Automorphismus.—Analog beweist man die Behauptung fiir p. 

Wir fassen zusammen, indem wir noch das Ergebnis von Nr. 24 heranziehen: 

Fir jede n-dimensionale T-Mannigfaltigkeit T sind zwei dimensionstreue Auto- 
morphismen \ und p von R(T) und ein Homomorphismus ® von R(T) in R(T X T) 
ausgezetchnet; ist z ein homogen r-dimensionales Element von R(T) und r < n, 
so gilt 

@(z) = (1 X A) + (Oo) X 1) + D (er X wr), 
(4.6) Xn, Yr homogen-dimensional mit 
r < d(x) < n, r<d(y) <n. 


In diesem Satz sind alle Eigenschaften der r-Mannigfaltigkeiten enthalten, 
die wir im Folgenden benutzen werden. 


28. Beweis des Satzes Ia (Nr. 15). Es sei (21, 22, --- , 21) ein irreduzibles 
Erzeugenden-System fiir die n-dimensionale I'-Mannigfaltigkeit !. Wir werden 





27 Unter einem ‘‘Automorphismus”’ eines Ringes ® ist ein Auto-7somorphismus von R 
auf sich zu verstehen. 
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durch vollstiindige Induktion nach k beweisen: das Produkt von je k voneinander 
verschiedenen Elementen dieses Systems ist # 0. Fiir k = 1 ist dies richtig, 
da ein irreduzibles Erzeugenden-System niemals die Null enthalten kann. Es 
sei fiir k — 1 bewiesen, und k Elemente des Systems, etwa 21, 22, --- , 2, Selen 
vorgelegt; da ihre Anordnung lediglich das Vorzeichen ihres Produktes bein- 
flussen kann, diirfen wir annehmen, dai . 


(4.7) d(a1) > d(z;), j ms 1, 2, shee »k 
ist; nach Induktions-Annahme ist 
Zq+2g- +++ Sp 0, 


also, da p ein Automorphismus ist, auch 
(4.8) p(Ze-Z3- --- -Z) #0. 


re , aie . . . . 
Wir setzen d(z;) = 2; fir i = 1, 2,---,1. Da d ein Automorphismus ist, 
. , , - . . . 
bilden auch 2; , 22, --- , 2, ein irreduzibles Erzeugenden-System. Da \ dimen- 
sionstreu ist, ist 


(4.9) d(z;) = d(zi), 
und nach (4.7) 
(4.7’) d(z;) < d(z}), 


Wir verstehen nun wie in Nr. 16—mit dem Unterschied, daf wir die dortigen 
z; durch unsere neuen 2; ersetzen—unter Ul das Ideal in R(P), das von zg, «++ ,21 
erzeugt wird (man vergl. Nr. 18); ferner verstehen wir unter 1* das, analog wie 
in Nr. 21 erklirte, zu U gehérige Ideal in R(T X T), also die Menge derjenigen 
Elemente von (I X IT), die sich in der Gestalt >» (w, X up) mit une Ul 
schreiben lassen. 

Schreiben wir fiir 7 = 1, 2, ,k das Element (z;) in der Form (4.6): 


(4.6,) ®(z;) = (1 X 2) + (o(2;)) X I + Y (ea X wr); 


so lautet auf Grund von (4.9) und (4.7’) die eine der Dimensions-Bedingungen 
aus (4.6) 


d(z;) S d(z;) < d(y,) <n; 


folglich gehéren nach Nr. 16 (a) alle in den Gleichungen (4.6;) auftretenden ys 
zu und daher die Summe >> (a, X yx) zu U*; fiir 1 <j < k gehdrt au®erdem 
z; zu U, also l X z; zu U*. Daher sind in den Gleichungen (4.6 ;) die folgenden 
Kongruenzen modulo 11* enthalten: 


®(21) = (1 X 21) + (p(a) X 1), 


P(z;) = (p(z;) X 1), 
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Da U* ein zweiseitiges Ideal ist, diirfen wir diese Kongruenzen miteinander 
multiplizieren, und da ® und p Homomorphismen sind, ergibt sich dabei, bei 
Beachtung von (3.1): 


@(z1-22- +++ Zu) = (o(ee- --- +24) X 21) +. (p(Zr-ze- --- +n) X 1). 
Ware nun 2-2: --- -2 = 0, so wiirde hieraus 
p(Z- +++ +2) X 2; = 0 mod ll* 


folgen; nach (4.8) und dem Lemma von Nr. 22 wire dann 
zi = 0 mod U, 


also z; in U enthalten; nach Nr. 16 (b) ist dies mit der Irreduzibilitat des 
Erzeugenden-Systems (21, 22, +++, 22) nicht vertraglich. Folglich ist 


Z1-Ze: +++ -Be 0, 
> 28 
was zu beweisen war. 


29. Beweis des Saizes Ib (Nr. 15). Im Ring der n-dimensionalen I'-Mannig- 
faltigkeit I sei z ein homogen-dimensionales Element, fiir welches d(z) < n und 
6(z) gerade ist; wir haben zu zeigen, dai dann z nicht maximal ist. 

Das Ideal % sei wGrtlich wie in Nr. 17 definiert ; B* sei das Ideal in R(T X TL), 
das analog wie in Nr. 21 durch % bestimmt ist: es besteht aus allen Elementen 
} (wi X Un) mit v, « B. 

In der Gleichung (4.6) fiir unser Element z gehéren alle Elemente y, zu B 
und somit gehért die Summe tox (tr X ya) zu B*. Es gilt also die Kongruenz 


(4.10) @(z) = (1 XK A(z)) + (e(z) X 1) mod ¥*. 
Nach (8.1) ist 

(4.11) (1 X A(z))-(o(z) X 1) = pfz) X ACZ) 

und, da 6(z) gerade ist, auch 

(4.11’) (p(z) X 1)-(1 X A(z)) = plz) X ACZ). 


Aus (4.11) und (4.11’) ergibt sich, daf man die rechte Seite von (4.10) nach 
der binomischen Formel potenzieren kann; tut man dies und beachtet man, 
da #, p, \ Homomorphismen sind, so erhalt man fiir jeden positiven Exponen- 
ten m: 


™m 


(4.12) &(2”") = >> ("") (p(z” *) X A(z’) mod ¥*. 


Nun ist, da d(z) < nist, fir r > 1 immer d(z’) < d(z), also, da \ dimensionstreu 
ist, auch d(A(z’)) < d(z) und daher A(z’) « B und 


p(z” ") X A(z’) « B* fir r > 1; 








*8 Der Beweis zeigt, daf fiir den Satz Ia der Koeffizientenbereich ein beliebiger Kérper 
sein darf. 
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daher reduziert sich die Kongruenz (4.12) auf 
(4.13) (2”) = m(p(z”) X A(z)) + (p(2”) XK 1) mod B*. 


Dies gilt fiir jeden positiven Exponenten m. Da aus Dimensions-Griinden 
z” = 0 fiir hinreichend grofe m ist, kann man m so wihlen, da 


27" #0, 2 =0 
ist. Dann entsteht aus (4.13) die Kongruenz 
m(p(z” ") X A(z)) = O mod &*, 
also, da der Koeffizientenbereich der Kérper der rationalen Zahlen ist,” 
(4.14) p(z”") X A(z) = O mod B*. 


Daz” ¥ 0 und p ein Automorphismus ist, ist auch p(z”') # 0; nach dem 

Lemma in Nr. 22 folgt daher aus (4.14) 
\(z) = 0 mod &, 

d.h. A(z) « B. Der Automorphismus X ist dimensionstreu, und die Menge ¥ 
ist durch Dimensions-Eigenschaften definiert; daher geht ¥ bei Ausiibung von 
\" in sich tiber, und mit A(z) ist daher auch z in B enthalten. Nach dem 
Hilfssatz Nr. 17 (c) ist mithin z nicht maximal—was zu beweisen war. 
Mit den Satzen Ia und Ib ist der Satz I vollstaéndig bewiesen (Nr. 15). 


§5. Maximale und minimale Elemente. Beweis des Satzes III 


30. Die Range 1l,. Wir kniipfen an Nr. 13 und 14 an. Es sei wieder M eine 
beliebige n-dimensionale Mannigfaltigkeit. Die additive Gruppe der homogen 
r-dimensionalen Elemente von #(M), also die r-te Bettische Gruppe (in bezug 
auf rationale Koeffizienten) heige %, ; ihr Rang p, ist die r-te Bettische Zahl 
von M. 

Fir 0 S r < n verstehen wir unter U, die Gruppe derjenigen Elemente von 
%,, welche sich aus den Elementen der Gruppe 


Bn, + Bn + sh + B41 


durch Multiplikation und Addition erzeugen lassen, mit anderen Worten: welche 
nicht maximal sind. U, lat sich auch als die Gesamtheit derjenigen Elemente 
charakterisieren, die sich in der Form ). 2;-y; schreiben lassen, wobei x; , yi 


2 


homogen-dimensional mit 


iti) + (yi) =n—7r, OK< dx), 0 < Hy) 
sind. 
Den Rang von U, nennen wir g, , und wir setzen 


Powe ~ Ga. = ,, s=1,2,-.--,n. 








29 Dies ist die einzige Stelle in der ganzen Arbeit, an welcher benutzt wird, da8 der 
Koeffizientenkérper die Charakteristik 0 hat. 
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Die Zahl 1, ist der Rang der Restklassengruppe 8,_, — U,—. und hat daher die 
folgende Bedeutung: es gibt ein System von /, derartigen maximalen Elementen 
Yi, Ye,» Yt, Mit 5(y;) = s, da®B auch jede lineare Verbindung 


UYi + Uyo + --- + UL,Yi, 


mit rationalen Koeffizienten u;, abgesehen von derjenigen mit u) = --- = 
um, = 0, selbst maximal ist; dagegen gibt es fiir kein /’ > 1, ein System von I’ 
derartigen Elementen. 


31. Wir behaupten erstens: In einem Erzeugenden-System (2), --- , 2,) von 
R(M) ist die Anzahl der z; mit 6(z;) = s stets = lI, . 

Beweis: Es seien y;, --- , yz, maximale Elemente mit der soeben genannten 
Eigenschaft. Stellt man die y; als Polynome in den Erzeugenden 2, --- , 2, 
dar, so haben diese Darstellungen aus Dimensions-Griinden die Gestalt 


Yi = lati +--+ + bime@m + Yi; 
dabei sind 2, --- , 2m diejenigen Erzeugenden z;, fiir welche 6(z;) = s ist 
(m = 0), t:; rationale Zahlen und Y; Polynome in den z, mit d(z,) > n — s. 
Wire m < l, , so besife das Gleichungssystem 

ls 

D uti = 0, j =1,-++,m, 

t=1 
eine rationale Lésung (%, --- , u:,) ¥ (0, --- , 0), und es wire 

Wyi t--- + U,Y1, = mY; +---+ ui,Yi, ; 
dieses Element ware, wie die rechte Seite zeigt, nicht maximal—entgegen der 
Voraussetzung iiber die y; . 
Wir behaupten zweitens: Ist—bei Benutzung derselben Bezeichnungen wie 


soeben—m > l, , so ist das Erzeugenden-System (2, , --- , Zz) reduzibel. 
Beweis: Dal’ = m > l, ist, kann man, wie am Schlu& von Nr. 30 festgestellt 
wurde, Zahlen uw, --- , Um SO bestimmen, da das Element 


Z = U2) + -+> + UmZm 


nicht maximal ist, und daf nicht alle vu; = 0 sind; dann ist Z ein Polynom in 
den z, mit d(z.) > n — s, also erst recht in den z; mit 7 > m, und man kann, 
wenn etwa u, ~ 0 ist, z, durch die z; mit 7 > 1 ausdriicken und somit das 
Erzeugenden-System reduzieren. 

Damit haben wir festgestellt: In jedem irreduziblen Erzeugenden-System 
(21, +--+, 22) ist die Anzahl derjenigen 2; , fiir welche 5(z:) = s ist, gleich l, ; die 
Anzahl aller Erzeugenden eines irreduziblen Systems ist daher immer 


Links +h,” 


*° Hierdurch ist jeder Mannigfaltigkeit M eine Invariante | zugeordnet, welche etwa der 
“Rang” von M heifBen mége. Fiir eine r-Mannigfaltigkeit ist, wie sich aus dem Satz (b) 
in Nr. 4 und seiner Herleitung ergibt, die Summe der Bettischen Zahlen gleich 2'; anderer- 
seits ist fiir eine Gruppen-Mannigfaltigkeit, wie Cartan mit Hilfe der Integral-Invarianten 
berechnet hat’, (b), 24, der hier auftretende Exponent gleich dem ‘‘Rang’’ der Gruppe, 
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32. Die minimalen Elemente. Es sei v ein homogen-dimensionales Element 
von #(M) und d(v) > 0. Wir betrachten seine Vielfachen, d.h. die Elemente 
x-v, wobei x die Elemente von #(M) durchlauft; unter diesen Vielfachen sind 
au®Ber dem Null-Element erstens alle rationalen Vielfachen von v enthalten— 
das sind Elemente der Dimension d(v)—und zweitens, falls nur v ¥ 0 ist, nach 
dem Poincaré-Veblenschen Dualitits-Satz alle Elemente der Dimension 0; fir 
alle Vielfachen ist d(z-v) < d(v). Wir definieren nun: das Element v heift 
“minimal’’, wenn es keine Vielfachen «-v besitzt, fiir welche 


xv £0, 0 < d(x-v) < d(v) 


ist. Diese Definition ist offenbar gleichwertig mit der folgenden: v ist minimal, 
wenn fiir alle homogen-dimensionalen 2, fiir welche 


0 < d(x) < d(v), alson — d(v) < d(x) < n, 


ist, die Produkte x-v = 0 sind. 

Dabei ist, wie oben gesagt, d(v) > 0 vorausgesetzt; diese Verabredung ist 
analog der frither getroffenen, die n-dimensionalen Elemente nicht als maximal 
zu bezeichnen. Dagegen ist das Element 0 des Ringes (M) minimal.” 


33. Ein Dualitdtssatz. Die Gruppen U, sind in Nr. 30 erklairt worden. Wir 
behaupten : 

Das homogen s-dimensionale Element v ist dann und nur dann minimal, wenn 
es ein Annullator der Gruppe Un-s ist, d.h. wenn u-v = 0 fiir gedes Element u aus 
U,—. gilt (¢ = 1,--- ,n). 


Beweis: Es sei erstens v minimal und wu ¢ U,-. ; dann ist u = > 2:-yi, wobei 
die x; , y; homogen-dimensional sind, mit 


5(2,) + 5(y;:) —s, 6(z;) > 0, 5(y;) > 0, 


also 
0 < (yi) < s = d(v); 


hieraus folgt y;-v = 0 fiir jedes 7, also u-v = 0. Es sei zweitens v homogen 
s-dimensional, aber nicht minimal; dann gibt es ein homogen-dimensionales y 
mit 0 < 6(y) < sund y-v ¥ 0, und nach dem Poincaré-Veblenschen Dualitats- 
satz gibt es dann weiter ein homogen-dimensionales x mit 6(z) = d(y-v) und 
.a-y-v € 0; da d(x) = n — S(y) — 4(v), also d(zy) = n — s ist, ist r-y € Un-s ; 
somit ist v nicht Annullator von lUl,_, . 

Aus der damit bewiesenen Charakterisierung der minimalen Elemente als 





d.h. gleich der Dimension der maximalen Abelschen Untergruppen. Die Aufgabe, die 
Gleichheit zwischen dem Rang einer Gruppen-Mannigfaltigkeit—in dem Sinne, wie wir l 
oben fiir jede Mannigfaltigkeit M erkliart haben—und der genannten Dimensionszahl auf 
modglichst rein geometrischem Wege aufzuklaren, habe ich in einer Arbeit behandelt, die 
in der Com. Math. Helvet. 1941 erscheint. 

31 Alle homogen 1-dimensionalen Elemente sind offenbar minimal. 
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Annullatoren ist erstens ersichtlich, da® sie eine additive Gruppe &, bilden— 
(dies kann man auch direkt im Anschlu8 an die Definition in Nr. 32 leicht 
feststellen)—, und zweitens erkennt man mit Hilfe des Poincaré-Veblenschen 
Dualititssatzes jetzt ohne Miihe, dai der Rang dieser Gruppe &%, gleich 
Pn» — Qn-s (Nr. 30), also gleich J, ist. Damit ist der folgende Satz bewiesen, 
der eine neue Charakterisierung der Zahlen /, enthilt: 

Die s-dimensionalen minimalen Elemente von R(M) bilden eine additive Gruppe 
vom Range l, . 


34. Hin Invarianzsatz. Die Eigenschaft der “Minimalitit’ ist invariant 
gegeniiber beliebigen stetigen Abbildungen; genauer: M und M’ seien beliebige 
Mannigfaltigkeiten, und F sei eine Abbildung von M in M’; die dadurch bewirkte 
Abbildung von §(M) in #R(M’) nennen wir ebenfalls F. Dann ist fiir jedes 
minimale Element v von R(M) das Bild F(v) minimales Element von R(M’). 

Beweis: x sei ein homogen-dimensionales Element von #t(M/), und das Ele- 
ment F(x) von ®(M’) sei nicht minimal; wir haben zu zeigen, da® x nicht 
minimal ist. Falls d(x) = 0 ist, ist nichts zu beweisen; es sei d(x) > 0; dann 
ist, da die Abbildung F von 8(M) in R(M’) dimensionstreu ist, auch 
d(F(x)) > 0; da F(x) nicht minimal ist, gibt es ein solches homogen-dimen- 
sionales Element z von R(M’), daB 0 < 6(z) < d(F(x)) = d(x) und z-F(z) ¥ 0 
ist. Bezeichnet & den Umkehrungs-Homomorphismus von F (Nr. 25), so folgt 
aus (4.1), daB auch (z)-2 ¥ 0 ist; dies bedeutet, da nach Nr. 25, 3), 6(@(z)) = 
5(z), also 0 < 6(@(z)) < d(zx) ist: x ist nicht minimal. 


35. Sphdrenbilder. Fiir die s-dimensionale Sphire S, ist dasjenige Element 
von #(S,), das durch den Grundzyklus reprisentiert wird—also das Eins- 
EKlement—minimal; daher ist in dem soeben bewiesenen Satz der folgende ent- 
halten—(statt M’ schreiben wir M)—: Fiir eine beliebige Mannigfaltigkeit M 
ist en Element von R(M), das durch das stetige Bild einer Sphdre in M repra- 
sentiert wird, stets ein minimales Element.” 

Hieraus folgt weiter, da nach Nr. 33 die Gruppe %, den Rang /, hat: Ist 1, = 0, 
so ist in M jedes stetige Bild der s-dimensionalen Sphdre homolog 0.” 


36. Anwendung auf T-Mannigfaltigkeiten. Nach Satz Ib gibt es in einer 
I'-Mannigfaltigkeit kein maximales Element z mit geradem 6(z); daher sind die 
Ringe l, fiir alle geraden s gleich 0; aus Nr. 35 folgt mithin der Satz III (Nr. 9).” 

Fir ein Sphirenprodukt 


Bug x Su x ++ & Bangs d(Sm,;) =m, 


gibt die Zahl 1, an, wieviele der Zahlen m; gleich s sind; dies folgt aus der Charak- 
terisierung der 1, am Schlu® von Nr. 31 und der Tatsache, die man leicht 





*? An die Stelle einer wirklichen Sphire darf offenbar auch eine Homologie-Sphire (in 
bezug auf den rationalen Koeffizientenbereich) treten, d.h. eine Mannigfaltigkeit, welche 
dieselben Bettischen Zahlen hat wie eine Sphire. 
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bestitigt, dafB8 die Elemente Z;, --- , Z:, die in Nr. 3, Formel (*), angegeben 
sind, ein irreduzibles Erzeugenden-System bilden, und daf 6(Z;) = m, ist. 
Fir eine T-Mannigfaltigkeit I gibt also J, an, wieviele Faktor-Spharen in dem 
Sphirenprodukt, das einen mit T isomorphen Ring hat, s-dimensional sind. 
Hierin ist enthalten, da die Exponenten |, in Nr. 4, Formel (1’), mit unseren 
Rangen I, iibereinstimmen. 


37. Die Pontrjaginsche Multiplikation und eine Vermutung. Die Grundlage 
fiir Pontrjagins Untersuchung der Homologie-Eigenschaften geschlossener 
Liescher Gruppen” ist die folgende “Produktbildung”: In der Mannigfaltigkeit 
M sei eine stetige Multiplikation erklart; « und y seien zwei Zyklen in M; 
werden x und y von zwei Punkten p bzw. q durchlaufen, so durchlauft das stetige 
Produkt pq einen Zyklus, den wir mit x O y bezeichnen wollen; variieren x 
und y in ihren Homologieklassen, so indert sich die Homologieklasse von z O y 
nicht; wir diirfen daher x O y als Produkt zweier Homologieklassen x und y 
deuten; in unserer Ausdrucksweise aus Nr. 26 ist 


zrOy= F(x xX y). 


Diese Produktbildung ist mit der Addition der Homologieklassen offenbar dis- 
tributiv verkniipft; nehmen wir nun weiter an, die stetige Multiplikation in M 
sei assoziativ, dann wird auch die Produktbildung z O y assoziativ sein. Somit 
sieht man: Fiir eine Gruppen-Mannigfaltigkeit T la&t sich die additive Gruppe 
der Homologieklassen nicht nur durch die Schnittbildung z-y zu dem Ring R(L), 
sondern auferdem durch die Pontrjaginsche Produktbildung x O y zu einem 
Ring $(t) machen. 

Die Bedeutung des Ringes (I) liegt auf der Hand: in seiner Struktur duBert 
sich die Wirkung der Gruppen-Multiplikation auf die Zyklen—also ein Zusam- 
menhang zwischen den algebraischen Eigenschaften der Gruppe einerseits, den 
geometrischen Eigenschaften der Mannigfaltigkeit andererseits. Die Unter- 
suchung dieses Ringes ist daher gewif eine wichtige Aufgabe. Ich will hierzu 
eine Vermutung dufern, die sowohl durch Pontrjagins Ergebnisse, als auch 
durch unseren Satz I nahegelegt wird.” 

Auch in jedem Spharen-Produkt 


H = Sm, X Sig X ++ X Sm; 


lat sich neben der Schnittbildung in naheliegender Weise noch eine zweite 
Multiplikation, die “Aufspannung”’, erkliren. Wie in Nr. 3 bezeichnen wir den 
Grundzyklus der Sphire S,, selbst mit S, und mit p immer einen einfach 
gezihlten Punkt; wir stellen den Elementen (*) aus Nr. 3 die folgenden Ele- 
mente gegeniiber: 

Vi= Sn, XpXpX--- Xp, 


(**) Ve = p X Sm X PX --+ XP, 


Vi=pXpXpxX-:-:: X Sm,. 
33 Man vergl. zum Folgenden: Cartan’, (b), 25-26. 











GRUPPEN-MANNIGFALTIGKEITEN UND IHRE VERALLGEMEINERUNGEN 51 


Dann definieren wir zunachst fir 
bins: <& 
das von Vi,, Vi,,--- , Vi, aufgespannte Element 
(5.1) Vi, @ Vi, @--- @Viy =X KX XK --- KX 4X 
durch die Festsetzung: 
z; = Sm, fir] = 1,%,---,%&, 
x; =p _ fir alle anderen 7. 


Durch (5.1) sind 2’ Elemente erklart, die eine volle additive Basis bilden. Fir 
je zwei Elemente (5.1) 


(5.2) X=Vi,@Vi®@-:-@Vi, Y=V;i,@V;,®@--: @Vi, 


definieren wir das aufgespannte Element X @ Y dadurch, daf wir die rechten 
Seiten von (5.2) formal nach dem assoziativen Gesetz miteinander multiplizieren 
und die Regeln 


V; @ Vi = (-1)""V; ® V;, V;@V;,=0 


anwenden; dann wird entweder X @® Y = 0 oder +X @ Y wieder ein Element 
(5.1). SchlieBlich erkliren wir fiir beliebige Elemente X, Y das Produkt X @® Y 
dadurch, daf8 wir X und Y als lineare Verbindungen mit rationalen Koeffizienten 
der Elemente (5.1) darstellen und die distributiven Gesetze anwenden. 

Durch die damit erklirte Multiplikation wird die additive Gruppe der 
Homologieklassen des Spharenproduktes II zu einem Ring O(II) gemacht. 
Die Struktur dieses Ringes ist leicht zu iibersehen: man stellt leicht fest, daB 
die Ringe #(II) und Q(I1) miteinander isomorph sind, und daf dieser Iso- 
morphismus durch eine gewisse Dualitaét vermittelt wird, die sich zunichst darin 
fiuSert, da einem Element x von (II) immer ein Element y von Q(II) mit 
d(y) = 6(x) entspricht. 

Die oben erwahnte Vermutung beziiglich des Pontrjaginschen Ringes $(T) 
einer Gruppen-Mannigfaltigkeit T ist nun die folgende: 

Die, auf Grund des Satzes I mégliche, isomorphe Abbildung der Ringe R(T) 
und (Il) aufeinander lat sich so wahlen, daB sie zugleich die Ringe B(T) und 
Q (II) tsomorph aufeinander abbildet. 

Damit ware die Struktur des Ringes (I) sowie die Beziehung zwischen den 
Ringen $(I') und R(I) weitgehend geklart. 

Unter anderem wiirde noch die folgende Rolle der minimalen Elemente einer 
Gruppen-Mannigfaltigkeit sichtbar werden. Offenbar sind die Elemente 
Vi,---, Vi, die durch (**) gegeben sind, minimale Elemente von (II), und 
zwar bilden sie eine additive Basis der vollen Gruppe der minimalen Elemente 
von R(II), d.h. der Vereinigung der Gruppen %:,---, B, (Nr. 33); daher 
wirden auch die Elemente 1, --- , v: von R(T), welche bei dem vermuteten 
lsomorphismus den V; entspriichen, minimal sein und eine Basis der vollen 
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Gruppe der minimalen Elemente von (I) bilden; da aber die Elemente (5.1) 
eine volle additive Basis in II bilden, wiirden auch die Pontrjaginschen Produkte 
Vix O Vig O alas O Viz » 

ty <dgees Kk, 


eine volle additive Basis in T bilden. Es wiirde sich also herausstellen, dai 
die minimalen Elemente 1 , v2 , --- , ¥: in ganz analoger Weise durch die Pontr- 
jaginsche Produktbildung den Ring $(C) aufspannen, wie die maximalen 
Elemente 2, 22, ---, 2: durch die Schnittbildung den Ring R(I) erzeugen.™ 


Ziricu, SWITZERLAND. 





% Die oben ausgesprochene Vermutung, fiir die ich keinen Beweis gefunden hatte, habe 
ich miindlich Herrn H. Samelson mitgeteilt, und dieser hat ihre Richtigkeit inzwischen in 
vollem Umfange bewiesen.—Sie bezieht sich iibrigens ausdriicklich auf Gruppenmannig- 
faltigkeiten, und fiir den Beweis ist die Benutzung des assoziativen Gesetz wesentlich. 
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ON THE CARTAN INVARIANTS OF GROUPS OF FINITE ORDER* 


By Ricuarp BRAUER 
(Received April 18, 1940) 


1. INTRODUCTION 


E. Cartan, in his fundamental paper on hypercomplex numbers,’ introduced 
an important set of invariants ca , (x, A = 1, 2, --- , k) of an algebra A with a 
principal unit 1. Here, & is the number of prime ideals 8, of A. The cy are 
non-negative integers which also play an important réle in the decomposition 
of the regular representation of A. 

Let us consider now a semisimple algebra I of rank n over an algebraic number 
field K, and let J be an integral domain’ of f. Every prime ideal p of K gen- 
erates an ideal p,; of J. The nature of this ideal is determined by the structure 
of the residue class ring J/p,. This ring can be considered as an algebra over 
the residue class ring 0/p, where o denotes the domain of all integers of K. 
Hence, we may form the Cartan invariants ca(p) of J/p,. We have in this 
case Ca(p) = Cxx(p), and the ca(p) are the coefficients of a non-negative quadratic 
form ¥ = >> ca(p)aa . 

In particular, these notions can be used in the case of the group ring [ of a 
group @ of finite order g. As field of reference, we choose an algebraic number 
field K, such that all the absolutely irreducible representations of © can be 
written with coefficients in K. The linear combinations of the group elements 
with integral coefficients in K form an integral domain J of T. Hence, we may 
form the Cartan invariants c,(p) for every prime ideal p of K. It appears that 
they actually depend only on the rational prime p which is divisible by p». 
Accordingly, we denote them by ca(p).* If p is not a divisor of the group order 
g, then the matrix C(p) = (ca(p)) is the unit matrix 1, i.e. ca(p) = ba. We 
therefore restrict our attention to the case where p divides g, in which case p 
is a divisor of the discriminant of J. To every such prime p, we obtain in ¢a(p) 


s 





* Presented to the American Mathematical Society on April 26, 1940. 

‘ E. Cartan, Annales de Toulouse, 12 B, (1898), p. 1. Cf. further R. Brauer, Proc. Nat. 
Acad. Sci. 25, (1939), p. 252. 

2 By an integral domain (Ordnung) J of I, we understand a subring / of I with the 


following properties: (a) J contains all the integers of K; (b) The rank of J is n; (c) The 
ay 
elements of J, when expressed by a basis « , «2 , -*:, én of J. have the form a = i oo 
where the a; are integers of K and w is a fixed integer of K which is independent of a. 
* Cf., for instance, A. Speiser, Theorie der Gruppen von endlicher Ordnung, 3 rd ed., 
Berlin 1937, theorem 181, p. 204. : 
‘ For the properties of the cxa(p), in this case, cf. R. Brauer and C. Nesbitt, University 


of Toronto Studies, Math. Series No. 4, 1937, and a paper forthcoming in the Ann. of Math. 
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a set of invariants of the group G, which are of great importance for the theory 
of group characters. The aim of this paper is the determination of the dis- 
criminant | C(p)| of y. We prove 

THEOREM 1: The determinant | ca(p) | of the matrix of Cartan invariants of a 
group & of finite order is a power of p. 

The exact exponent of p in | ca(p) | is given below in theorem 1*. 


2. PREPARATIONS FOR THE PROOF 


We denote by ©, G:,---, G, the classes of conjugate elements of the 
group @, which consist of p-regular elements.” Let g = g/m be the number 
of elements in ©, , so that m is the order of the normalizor of an element of © . 
To each class © , there corresponds a reciprocal class G,+ containing the recipro- 
cals of the elements of GQ. We have, then, k absolutely irreducible modular 
characters, ¢"’, g®, --- , e® of @ (mod p); we may arrange the values ¢\” of 
y” for the class G in the form of a matrix 


= (gx) (k, A= 1, 2,+-: , k). 


The number k here also gives the degree of the Cartan matrix C = (c,a(p)) 
and the matrices ® and C are connected by the formula 


(1) ®’C& = (n,bas),° 


: 1) 2) k ° P 
provided that g’, »”, .-- , e™ are taken in a suitable arrangement. On form- 


ing the determinant in (1), we obtain 
(2) \@|?|C| = zm --- nme. 


This shows that |C | # 0. Hence, the non-negative quadratic form yw is 
positive definite, ie. |C| > 0. Further, the determinant || is prime to p.’ 
Therefore, theorem 1 will be proved when we can show 

THEOREM 2: Let G,, G:,---, G be the classes of p-regular, conjugate ele- 
ments of &, and let g, = g/ny be the number of elements of ©. If ® is the matrix 
of the modular group characters of @(mod p), then the square of the determinant 
| @ | is given by 


(3) |@P = + a es 


where p* is the highest power of p dividing nynz --- ny . 

At the same time, we obtain 

THEOREM 1*: The determinant of the Cartan matrix, (ca(p)), is equal to p*, 
where p* has the same significance as in theorem 2. 

For primes p which do not divide the order of G, theorem 2 is trivial. Here, 





’ By a p-regular element of G, we understand an element whose order is prime to p. 
6 Cf. the formulae (29) and (15), respectively, of the papers mentioned in ¢. 
7 Cf. the formulae (26), (17), respectively, of the papers mentioned in ‘. 
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k is the number of all the classes of conjugate elements of @, and p* = 1. The 
relation (3) is obtained by multiplying ®’ and 9, using the orthogonality relations 
for group characters. In the same manner, the analogous formula for ordinary 
group characters (instead of modular characters) can be obtained at once. 

In order to prove theorem 2, it is sufficient to show that if gq # pis a rational 
prime, and if ¢° divides the right hand side of (2), then ¢° divides |@|°. We 
shall prove that by proving a similar statement for certain minors of &. We 
first have to give some simple group theoretical considerations. 

Let A be an element of © such that the order of A is prime to pand g. We 
shall say that an element G of © contains A as its q-regular factor, if G is of 
the form G = AQ, where the order of Q is a power q’ 2 1 of g, and where AQ = 
QA. Of course, A and Q are uniquely determined by G; both can be written 
as powers of G. If G, is conjugate to AQ, then the g-regular factor of G, is 
conjugate to A. 

Let Ai, Ae, --- , Am be a maximal system of elements of G, such that A;, 
A; are not conjugate for 7 ¥ j and the order of each A; is prime to p and gq. 
With each A;, we associate those classes of conjugate elements, G{’, Gi’, 
... , Gf, which contain elements with A; as their g-regular factor. Each of 


the classes, ©;, G2, --- , G;, then appears exactly once in the form 6°". By 
expanding | ® |, we see that |®| is a sum of terms 

(4) TiT2 paces Tn ’ 

where 7; is a minor of degree h; of &, containing only the columns which belong 
to G1”, G3", ---, Gf. 


We now state 

THEOREM 3: Let A be an element of an order not divisible by the two primes p 
and q, and assume that the h classes, ©, , G., --- , ©, are all the classes of con- 
jugate elements of the group ©, which contain elements G with A as their qg-conjugate 
factor. If o, eo, --- , © are any h modular characters (mod p) of G, then 


(r) (r) (r) 
r fee eo! 


Yp Yo 
(8) (8) (8) } 
(5) | A | = | Pp Ve re =0 (mod (p90 ++ G)"), 
oo... of 


where gy is the highest power of q dividing n .* 

Each 7; in (4) has the form (5) (for A = A;). From theorem 3, it follows 
that the expression (4) is divisible by (qq --- q)', and, hence, that || is 
divisible by the same number. It is therefore sufficient to prove theorem 3 in 
order to prove theorems 1 and 2. Changing the notation, if necessary, we may 
assume without restriction that 

p=le=2,---,r=h, 
r=zle=z=2,.--,tzh, 





* An analogous theorem holds for ordinary group characters. Here, the assumption 
that the order of A is prime to p is not necessary. The proof is the same as for theorem 3. 
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and then (5) assumes the form 


yg), ’ eh | , 
(6) [A] =] -cccerceeoeee |= 0 (mod (qig: -+- g)’). 
(h) (h) | 
Yl ; » Ph 


The proof of (6) will be given in §3. First, we must formulate and prove a 
group theoretical lemma. Let AQ; , AQ, --- , AQ, be representatives for the 
h classes ©, G2, --- , G,, where A is the q-regular factor of AQ; , and where 
Q:; = 1. Let M be the normalizor of A in G, and let Q be a Sylow subgroup 
of % belonging to the prime g. Then 7 is the order of Jt and q: the order of 
©. Each AQ; will commute with every element of a certain subgroup 9; of 
order g; of ©. Since A and Q; both are powers of AQ; , each of them com- 
mutes with every element of Q;. 

In particular, we have Q; © N. Replacing Q; by an element N7z'Q:N;, 
with N; in Jt, we may assume that 


(7) QO: |S Q, 


as follows easily from Sylow’s theorem. Since Q; must belong to 0;, the 
element Q; itself will belong to Q. If Q is any element of O, then AQ will be 
conjugate in © to some AQ,, ie., @ 'AQG = AQ;. Raising this equation 
to suitable exponents, we obtain G'AG = A, G@’QG = Q;. Therefore, Q 
and Q; are conjugate in Yt, and hence Q; , Q: , --- , Q, form a complete system 
of representatives for those classes of conjugate elements in Jt, in which the or- 
ders of the elements are powers of gq. 

In Q, the elements Q: , Qe , --- , Q, need not form a complete system of repre- 
sentatives for the classes of conjugate elements. However, we may construct 
such a system by adding further elements Q to the set Q:, Qo, --- , Qs. Each 
Q will, in 2, be conjugate to a certain Q; where 7 is uniquely determined, 7 = 
1, 2,.-., h. We denote the elements Q belonging to Q; by Q; = Q{°, Q°, 
Qy,..- , Qf, (li; = 0). Let gf” be the highest power of g dividing the order 
of the normalizor of Q§” in Q. According to (7), we have 


(8) qi = qi. 
We now prove the 
Lemma: The numbers qi”, (A = 0, 1, 2, --+ , l;) are divisors of qi. If exactly 
d; of them are equal to q; , then 
(9) d; 0 (mod q). 


ProoF: Let ®; denote the class of conjugate elements of Xt, which contains 
the element AQ;. The number M; of elements in &; is equal to the order of Jt 
divided by the order of the normalizor of AQ;. Hence 


(10) M, = 2 Mm, with (M;,q) = 1. 








_ gg cos 2 natn fotos ee | OCOoelCU DO 
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The class ®; can be broken up into partial classes &{”’, where each &{” con- 
sists of elements which are conjugate by means of transformations by elements 
of QS R. The elements AQ~”, AQ!”, --- , AQS"® will each determine such 
a partial class, but there may be further partial classes which do not contain 
elements AQ with Qin Q. In any case, if A TS” is an element of &{”, and if 


T’” commutes with exactly w{” elements of Q, then the number M$” of ele- 


ments of ®{” is given by 
(11) MY = 


=. 
w" 


We have, of course, 


(12) M; = > M. 
7 


In G, the elements AQ; and A7$” are conjugate. Hence, the order of the 
normalizor of A7'$” in @ is divisible by q; but not by a higher power of g. On 
considering the subgroup generated by ATS” and the w{” commuting elements 


of Q, we readily see that w{” < q; and that the equality sign can hold only 


if TS” belongs to Q. When Tf” = Q{”, wi” = g{”, and we thus obtain the 
first part of the lemma. If wi” = q;, then the partial class &{” contains 
exactly one element Q{”, (A = 0, 1, 2, --- , l;), and we have g§” = q;. Accord- 


ing to our assumption, there are exactly d; such partial classes. Therefore, d; 
of the numbers M$“’, (cf. (11)), are equal to qi/q:, the remaining ones being 
divisible by a higher power power of g. Then (12) gives 


M.=4;% (moa *) 
qi % 
and, on comparing this with (10), we obtain (9). 


3. PRooF OF THE THEOREMS 


As we have seen, it is sufficient to prove (6). If § is any representation of G, 
we may assume that the matrix §4 representing the fixed element A appears 
in canonical form, i.e. 


ay I, 
A sit oa — aol,, 
Where a , ag, --- are distinct roots of unity and 1, , v2: , -- - are positive integers. 
The matrices §, representing elements Q of Q, then break up in the form 
Vi 


Q- Fe = V2 
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where V; is of degree v;. For a fixed j, the matrices V; form a representation 
B,of Q. Leto, 9, ... , 8 be the distinct, irreducible characters® of OQ. 
Then m is the number of classes of conjugate elements in Q, i.e., the number 


of elements Q{”’. If x denotes the character of §, we readily obtain 


x(AQ) = 2 248% @) 


where the z, are algebraic integers which are independent of Q. We set, 


: accordingly, 
ey 


i (13) e°(AQ) = Li zd” (Q). 


. ° ) . (») ° P 
In particular, we use this for Q = Q{”. Since Q; and Q{” are conjugate in MN, 
we have 


(14) y» Ze,” (Q;) ai > Zu0” (QS). 


We now introduce matrix notation. Let us denote by © the matrix (8 (Q$”)) 





of the characters of ©; the rows here are fixed by one index x, (x = 1, 2, --- , m), 
; the columns by the two indices 7, v, (¢ = 1, 2,---,h; v = 0, 1, 2,---, Uj). 


We arrange the columns so that first the h columns with v = 0 appear and then 
the m — h other columns. Thus, 


ie) = (Oo ’ 0;) 





where 0p is of type (m, h)"° and ©, of type (m, m — h). 
Since the notation in (6) was chosen such that the class G, contained AQ, , 
we-have gx” = y” (AQ). On account of (13), the matrix A in (6) can be 


written in the form 


A = ZOy 


where Z is the matrix (Za), (ck = 1,2,---,h;X = 1, 2, --- , m), of type (h, m). 
In order to replace the matrices by square matrices, we set 


2-() 


where U’ is a matrix of type (m — h, m) with arbitrary integral coefficients. 


Then 
> Z ZQo Ze ri A es 
= (7) (0, @) = ‘pod wien ¥ a Ue)" 


Here we subtract every column (7, 0) from all the columns (7, \), with A > 0, 
and with the same first index. Then, according to (14), Z@; on the right hand 


















* Since the order of © is prime to p, there is no difference here between the ordinary and 
the modular characters (mod p) of Q. 
10 i.e., a matrix with m rows and h columns. 
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side will be replaced by 0. If Of is obtained from 0, by subtracting from each 
column (i, A), with A > 0, the column (7, 0) of @ , then, by taking the deter- 
minant, we find 

|Z||@| = |A|| Ver |. 
Here, | Z| is an algebraic integer, and | 6 | has the value 


h ly 
jo! =] IT @)*." 


i=l »v=0 
On account of (8), we obtain 
(15) |A|| Ver | = 0 (mod (qq --- ge) (Ii [a0 g?”)?). 


Let us assume now that the formula (6) does not hold. Since all the g{”’ are 
powers of q, it follows that 


(16) | Ver | = 0 (mod (q TT; [],50 @?)}), 

for any choice of U’. Taking a suitable U, we see that any minor of degree 
m — h of @; can be obtained in the form | Ver |. But the determinant 
| (0T)/O7 | is equal to the sum of the squares of all these minors. Hence, 

(17) | (0r)’0r | = 0 (mod q [Ti [],>0 9%). 

Any row in (er)’ , the transpose of er , is characterized by a pair of indices, 7, u, 
(( = 1, 2,---,hj w = 1, 2,---,1;), and any column is characterized by an 
index x, (x = 1,2,---,m). The rows of (9;)’0; are given in the same manner, 
and each column is characterized by a pair of indices j, v, with 7 = 1, 2, --- ,h; 


v= 1,2,.---,1;. For the element y(i, uw; 7, v) at the place (7, u), (j, v) in 
(or)'er , we obtain easily 


uli, wi») = D (QP) — 0°(Q)) (0° (@") - 0°(Q)) 


on account of the definition of @; . The sum here splits into four sums, each of 
which can be computed by means of the orthogonality relations for the group 
characters of Q. We set 6(R, S) = 1, if the elements R and S“ of Q are 
conjugate in Q, and in the other case we set 6(R, S) = 0. Since the normalizor 
of Qi” in Q has the order q”, we find 


y(t, u; J, v) 
= 6(Q$”, Q')qh” — 6(Q:, Qf’ )ai — 5(Q!”, Q,)ai” + 46(Q:, Qiqi 


For each 7, this expression vanishes unless Q;' and Q; are conjugate in G, since 
otherwise Q!” and Q‘” could not be conjugate in Q. Consequently, there is 
only one value 7 = 7* for a given i for which the expression can be different 


(18) 





'' This is the analogue of theorem 2 for ordinary group characters, and, as remarked in 
connection with theorem 2, this analogue is trivial. 








1) pain 
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from 0, and we have l; = lj. This shows that the determinant (17) splits into 
a product of m determinants 


(19) | @r)er | = +] MQ |-| MQ]. --- -| Qe], 


where 
2, 1* fixed, u row index, 
(20) iQ:| = | y@, wu; 7, ») |” 
v column index, p, v = 1, 2,--- , i. 


According to the lemma in §2, each q; is divisible by ¢§”; we set, accordingly, 
(21) y(t, w, *, ») = gi” ai(u, »). 

Then 

(22) 12; | = gig? --. gf? | Xi, 

where 

(23) X; = (xi(u, v)), (u row-index, v column-index). 
Using (17), (19), and (22), we obtain 

(24) | Xi || X2|---| Xu | =O (mod q). 


Because of (18) and (21), the matrix X; is the sum of four matrices X!”, X?, 


X{, X. In each case we have 7 fixed, and we denote the row-index by u, 
the column index by ». 


x? = @(@",0%), xP = -(2(@, a8) *), 
(25) qi 





x® _ —(3(Q*”, Qix)), x? = (1, Qi) =) 


\ 


As the lemma in §2 shows, we have d; — 1 values for » = 1, for which qi” ’ 


We may assume that these are the values u = 1,2, ---,d; —1.% Fory2= 
we have 


(26) qi/qi” = 0 (mod 4). 


Three cases must be considered separately. 

Cask I: 7 ¥ 7*. 

We may then assume that Q$”* = Q for all u. Then X$” is the unit 
matrix, X$? = X$° = 0, whereas in X$" each coefficient in the u-th row is equal 
to qi/qi”. Hence, (mod q), the matrix _— has d; — 1 rows consisting of 1, 
and the other rows are all 0. Then 


XOX = d-— 1X, tr (Xf) =a; - 1 (mod q). 





122 Tf 1; = 0, we must set | Q; | = 1. ; 
13 For d; = 1, the corresponding kinds of rows of X$?), do not occur. 
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This shows that (mod q) one characteristic root of X has the value d; — 1, and 
that the others have the value0. Then the characteristic roots of X; = I + Xf 
are given by d;, 1,1, ---,1 (mod gq). Hence 


(27;) |X;| =d; (mod q). 


Case Il. i = 7*, but Q; and Qj" are not conjugate in Q. 

We may assume here that Q7' = Q{”. Then Xf? = 0. In X?, only the 
first column contains elements different from 0, and the coefficients in this 
column are given by 


—q:/ae?, —¢q:/a8”, --- , —as/ah. 


In X{°, only the first row contains elements different from 0. All the coeffi- 


cients in the first row are equal to —1. 

In X$”, the first row and column are 0. Each of the other rows contains 
exactly one coefficient 1, and all the other coefficients are 0. The same is true 
for the second, third, --- , last column. On adding all the other rows to the 
first row in X; , we obtain easily 


|X.| = (4+ 4... +4 +1). 
a" oP a 


Hence, since d; — 1 of the fractions are 1, and the other ones = 0 (mod q), 
(272) | X;| = +d; (mod q). 

Case III. i = 7*, and Q; and Qj" are conjugate in Q. Here Xf = 0, 
xX =0. Asin case I, the first d; — 1 rows in X$ contain only coefficients = 1 
(mod g), and the latter rows contain only coefficients = 0 (mod q). The matrix 
X{ can be changed into the unit matrix, if the columns are taken in another 
order; the value of X{° (mod q) is not altered hereby. The argument used in 
the first case then gives 


(273) + |X;| =d, (mod q). 


The three formulae (27) show, in connection with the lemma in §2, that in any 
case | X;| 4 0 (mod q). Then (24) is impossible. Thus, the assumption that 
(6) is not true leads to a contradiction, and the theorems 1, 2 and 3 are proved. 
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MINIMAL SURFACES NOT OF MINIMUM TYPE BY A NEW MODE 
OF APPROXIMATION 


Marston Morse Anpb C. ToMPKINS 
(Received May 31, 1940) 


1. Introduction. We begin with a study of a non-negative function f*(p) 
defined at each point of a metric space M. We suppose that f’(p) is the limit 
of a sequence of functions f"(p) also defined over M. Let p, be a homotopic 
critical point p, of f"(p). Westudy the limit points g of sequences p, , regarding 
q as a new type of critical point of the limit function f(p). 

We give a simple application. 

Let g° be a simple, closed, rectifiable curve in a euclidean space with the 
property that the ratio of an arbitrary chord length of g’ to the smaller of the 
corresponding two arc lengths of g’ is bounded from zero. This chord are condi- 
tion on g’ is less restrictive than the condition imposed by Shiffman’ in his study 
of unstable minimal surfaces. We make use of Douglas’s Dirichlet function 
f’(v). The point ¢g is a monotone transformation of the arc length along g’. 
The basic result is as follows. 

If g° bounds two minimal surfaces of disc type belonging to disjoint minimizing 
sets of such surfaces (cf. §2), then g° also bounds another minimal surface of disc 
type not of minimum type. 

Results of this nature but under more restrictive hypotheses have been ob- 
tained previously by Morse and Tompkins’ and by Shiffman, loc. cit. We have 
found that less restrictive hypotheses on g’ are adequate, because they can be 
compensated by more restrictive hypotheses on the approximating functions 
f"(p), n > 0, making the analysis of the functions f"(p) much simpler. This 
applies particularly to the study of upper-reducibility.’ For it is easy to show 
that the restricted functions f"(p) are upper-reducible, while we do not need to 
show that f°(p) is upper-reducible. 


2. The mode of approximation. Let M be a metric space with points p, 
q, r and distances pq, pr, etc. satisfying the usual metric axioms. On M there 
shall be given a sequence of functions f"(p) converging to a function f’(p). Thus 


(2.1) lim f"(p) = f’(p). 





1Shiffman. The Plateau problem for non-relative minima. Annals of Mathematics, 
vol. 40 (1939), pp. 834-854. See (2) §3. 

2 Morse and Tompkins. The existence of minimal surfaces of general critical types. 
Annals of Mathematics, vol. 40 (1939), pp. 443-472. The letters MT will be used in the 
text to refer to this paper. See corrections elsewhere in this number of the Annals. 

§’Morse. Functional topology and abstract variational theory. Mémorial des Sciences 
Mathématiques, Fascicule 92 (1939). See §8. The letter M will refer to this pamphlet. 
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This convergence is in general not uniform. The functions f"(p), m = 0, 
1, ---, shall satisfy conditions I to VI to be described. If ¢ is an arbitrary 
constant and f is a function defined on M, f, shall denote the subset of points 
of M at which f S c. Our first condition is as follows. 

Hyporuesis I. Bounded compactness. The sets fe’, m = 0,1, --- , shall be 
compact for each constant c. 

It follows from this hypothesis that each function f” is lower semi-continuous. 

Hyporuesis II. Jf lim p, = p, 


(2.2) inf. lim. f"(p,) = f°(p). 


Hyporuesis III. Corresponding to any subsequence F" of the functions f" and 
points p, such that F'(p,) is bounded independently of r, there exists a subsequence 
of the points p, which converges to a point of M. 

We shall prove the following lemma. 

Lemma 2.1. Let e and c be positive constants. If n is sufficiently large, and b 
is a constant sufficiently near c, ft ts on the e-neighborhood N, of gi. 

Suppose the lemma false. There will then exist a constant e > 0 and a sub- 
sequence F” of the functions f” together with points p, ,r = 1, 2, --- , such that 
p, is not on N,, while 
(2.3) c = sup lim F’(p,). 

By virtue of Hypothesis III a subsequence of the points p, converges to a 
point g of M. 


Without loss of generality we may suppose that p; , pz, - - - is this subsequence. 
By hypothesis g is not on N, so that f°(q) > c. But it follows from II that 
(2.4) inf lim F’(p,) 2 f°(q). 


From (2.3) and (2.4) it follows that c = f°(q). From this contradiction we infer 
the truth of the lemma. 

Hypotruesis IV. Corresponding to each compact subset A of M, there exists 
a constant x* such that for each integer n, f" < xf? on A. 

We shall make use of chains and cycles taken in the sense of Vietoris with 
coefficients in the field of integers mod 2. Cf. M, §1. If pis a point of M, p* 
shall represent a 0-cycle each of whose component 0-cycles is on p. This con- 
vention will be permanent. The following lemma is an immediate consequence 
of our definitions. 

LemMA 2.2. A necessary and sufficient condition that two points p and q belong 
to the same component of a compact subset A of M is that p* + q* ~ Oon A. 

Let f be a function defined on M. A set A will be termed minimizing relative 
to f if f equals a constant ¢ on A, and if there exists an e-neighborhood A, of A 
such that f > C on A, — A. If f satisfies Hypothesis I, each minimizing set 
w of f is closed. For at a limit point qg of w, f(q) < ¢ by virtue of the lower 
semi-continuity of f. It follows from the definition of a minimizing set g that q 
belongs to w. Hence w is closed. 
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A subset of M on which f is bounded will be termed f-bounded. We shall 
prove the following theorem. 

THEOREM 2.1. Let y and z be points respectively of two disjoint minimizing 
sets w, and w, of f°. If y and z are connected by an f’-bounded subset of M and 
Hypotheses I to IV are satisfied, there exists a subsequence F" of the functions f" 
with the following properties. There exists a least value u, of ¢ for which 


p+ 9 ~0 (on F), 
with 
Mr > Max [F’(y), F'(z)\, 


while the numbers yu, converge to a limit uw such that 


w > max [f'(y), f°(2)]. 


We begin the proof of this theorem with certain lemmas. The first lemma 
concerns a compact subset C of M. A proof may be found in Morse,‘ page 430. 

Lemma 2.3. If u is a k-cycle on C such that u ~ 0 on C, for each positive e, 
then u~ 0 on C. 

We shall prove the following lemma. 

Lemma 2.4. Under the hypotheses of the theorem there exists a least value 
vo of c such that y* + 2* ~Oonft. 

It follows from the first hypothesis of the theorem and from Lemma 2.2 that 
there is at least one value of c such that y* + z* ~Oonfe. Let be the greatest 
lower bound of such values of c. There then exist values of c arbitrarily near 
vy such that y* + 2* ~0onf?. But it follows from Hypothesis I that if c is 
sufficiently near 7 , fe lies on an arbitrarily small neighborhood N, of fy,.. Hence 
y*+2z*~OonN.. Weset C = fy,. Wesee that y* + 2*isonC. It fol- 
lows from Lemma 2.3 that y* + z* ~ 0 on C, and the proof of Lemma 2.4 is 
complete. 

The points y and z are connected on f),. But y and z belong to disjoint min- 
imizing sets. These sets are compact, and hence at a positive distance from 
each other. It follows that 


(2.5) vm > fly), wm > f(z). 


The set f,, is compact. Hence on this set in accordance with IV, f" < x, 
where « is a constant. Hence y* + 2* ~0Qonff,,. Let», be the least number 
c such that y* + 2* ~Oonf?. That », exists follows as in the proof of Lemma 
2.4. We see that 


(2.6) Vn S KM% (n = 1,2,---). 





‘Morse. Rank and span in functional topology. Annals of Mathematics, vol. 41 (1940), 
pp. 419-454. 
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Lemma 2.5. If v, is the least number c such that y* + z* ~ O onfi,n= 
0,1, --- , and of wu = inf lim »,, then 


(2.7) uM = Vo. 
Let yu, be a subsequence of the numbers », such that 
(2.8) lim pw, = p. 


Let F’ be the corresponding subsequence of the functions f" so that y* + z* ~ 0 
on F%,. Let e be an arbitrary positive constant, and set f; = C. It follows 
from Lemma 2.1 that F;, C C., provided r is sufficiently large. Hence y* + 
z*~OQonC,. It follows from Lemma 2.3 that y* + z* ~0OonC. From the 
definition of v we infer that 4 = » , and the lemma is proved. 

We shall now prove Theorem 2.1, making use of the sequence F” and the 
numbers u, of the proof of Lemma 2.5. Observe that | 


u > max [f*(y), f*()] 


in accordance with (2.5) and (2.7). Recall that F’ and u, converge respectively 
tof’ and ». Hence if p is a sufficiently large integer 


ur > max [F"(y), F’(z)] (r = p). 


Theorem 2.1 is accordingly satisfied by sequences F” and yu, , starting with in- 
tegers r = p. 


3. h- and fh-critical points. Let / be a function defined on M. We refer 
to the definition of an F-deformation and a homotopic critical point of F (written 
h-critical point) as given on page 30 of M. Let F’, F’, --- be a subsequence 
of the sequence f’, f’, --- converging to f°, and let g, be an h-critical point of 
F’. Any limit point q of the point set g, will be termed an fh-critical point of 
f°. This term is relative to the sequence f”, to which the letter f refers in the 
term fh-critical point. 

Lemma 3.1. The set of fh-critical points of f° is closed, and the set of h-critical 
points of f° is closed among points at any given level p. 

That the set of h-critical points of f° is closed at the level » follows at once 
from the definition of an h-critical point. 

To prove that the set of fh-critical points of f° is closed, let p be a limit point 
of a sequence p, of fh-critical points of f°. For each fixed r, p, is by definition 
the limit of a sequence p(r, s), s = 1, 2, --- , of h-critical points of functions 
f*" where the integer n(r, s) increases with s. Hence if integers 8; , 82, -- 
are successively chosen with s, sufficiently large, the sequence p(r, s,) will con- 
verge to p as r becomes infinite, while the integers n(r, s,) will increase with r. 
Hence p is an fh-critical point of f°, and the proof of the lemma is complete. 

To continue we need two additional hypotheses. We refer to MT, page 445, 
for the definition of weak upper-reducibility. 
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Hyporuesis V. T'he functions f', f’, --- shall be weakly upper-reducible. 

Lemma 8.1 of M clearly holds if weak upper-reducibility replaces upper- 
reducibility. We shall use Lemma 8.1 of M in proving the following lemma. 
We refer to Theorem 2.1 and the constants yu, uw, and functions F” described 
therein. 

Lemma 3.2. The function F" of Theorem 2.1 assumes the value p, tn at least 
one h-critical point w, such that y, z, w, are in the same component of F;,, . 

Let « be the component of FZ, which contains y and z._ The set « is compact. 
Suppose there is no h-critical point of F;, on «x at the level u,. By virtue of 
Lemma 8.1 of M there will then exist an F’-deformation of « into a set definitely 
below u,. Hence y* + z* ~ 0 on F% for some c < yu,, contrary to the defini- 
tion of pu, . 

We infer the truth of the lemma. 

Hyrotuesis VI. (a). Let p, be an h-critical point of f"", mn < m <---, 
such that p, converges to p as r becomes infinite. Then f"’(p,) shall converge to 
f'(p). (b) Let p, be an h-critical point of f° such that p, converges to p. Then 
f’(p,) shall converge to f’(p). 

A point q will be said to be of non-minimizing type relative to f(p) if there is 
a point p in every neighborhood of qg such that f(p) < f(q). With this under- 
stood we come to a major theorem. 

TuroreM 3.1. If f° admits two disjoint minimizing sets w, and w: and if there 
are points y and z in w, and w, respectively which are connected by an f’-bounded 
set, if moreover Hypotheses I to VI are satisfied, then f° possesses at least one fh- or 
h-critical point of non-minimizing type. 

We refer to the numbers yu, and yw of Theorem 2.1, and prove the following: 

(i). The set o of h- and fh-critical points of f° at the level u is closed. 

This follows from Lemma 3.1. 

(ii). The set o is not empty. 

The h-critical point w, of Lemma 3.2 is a point at which F’(w,) = u,. These 
points w, have at least one cluster point w by virtue of Hypothesis III. It 
follows from Hypothesis VI (a) that f°(w) = u. Thus w is an fh-critical point 
belonging to o, so that o is not empty. 

(ii). There is at least one point of o of non-minimizing type. 

The set ¢, on which f’ = y, fails to contain the points y and z of Theorem 2.1, 
since p exceeds f’(y) and f°(z) as stated in Theorem 2.1. Observe that y, 2, and 
w lic on the same component of f? since y, z, and w, lie on the same component 
of F,,,, and the maximum distance of points of F,, from f; tends to zero as pr 
tends to uw. The set o cannot be a minimizing set, otherwise ¢ would be at a 
positive distance from its complement on fy, contrary to the fact that y, z, 
and w are connected on f; , while y and z are not in ¢ and wisin«¢. Since a is 
compact and not a minimizing set of f°, there exists at least one point of o of 
non-minimizing type. 

The proof of the theorem is complete. 
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4. The application to minimal surface theory. Let g’ be a simple, closed, 
rectifiable curve in a space of coordinates (x%;,---, tn). We suppose g’ has 


the vector form 
x = g((s), 


where s is the are length along g’. The curve g’ is given with a sense and an 
origin s = 0. So given, g will be termed coordinated. It will be convenient 
to suppose that the total length of g’ is 2x. We shall assume that g° satisfies 
the following condition. 

Chord arc hypothesis. Under this hypothesis the ratio of the length of an arbi- 
trary chord of g° to the length of the minimum subtended arc of g° shall be bounded 
from zero for all chords of g’. 

Any simple, closed, regular curve of class C’ satisfies this hypothesis, as does 
a simple closed curve composed of a finite set of regular ares of class C’, provided 
at each corner p the two tangent rays directed from p never make a null angle. 
In particular any simple closed polygon is admissible. This condition is less 
restrictive than the corresponding condition of Shiffman (loc. cit.). For under 
Shiffman’s condition two tangent rays at a corner cannot make angles less than 
a right angle, while it is easy to show that under Shiffman’s condition the chord 
are hypothesis is always satisfied. 

A sequence h” of closed, rectifiable, coordinated curves x = h"(s) will be said 
to converge in length’ to h°: x = h’(s) if 


lim h"(s) = h°(s) 


uniformly on each bounded interval for s. It is clear that there exists a sequence 
of simple, closed, coordinated polygons which converge in length to g’. The 
corners of these polygons can be “rounded off” to obtain a sequence of simple, 
regular, coordinated, closed curves g” of class C’ converging in length to g’. 
Without loss of generality we can suppose that the length of each curve g” is 
27, since if this is not already the case, a magnification from the origin in the 
ratio p, to 1 will bring this about, where p, must be suitably chosen and so chosen 
converges to 1. Let x = g”(s) be the vector representation of g”. 

Let (a) be a continuous non-decreasing function of a such that 


(4.1) g(a + 27) = o(a) + 2. 








° The coérdinated curves h" converge in length to h® if the following three conditions 
are fulfilled. The sensed curve h” converges to the sensed curve h® according to Fréchet, 
the length of h converges to the length of h®, and the point s = 0 on h" converges to the 
points = 0onh®. For related, but variant, definitions of convergence in length, see the 
following three sources. Adams and Lewy. On convergence in length. Duke Mathe- 
matical Journal, vol. 1 (1935), pp. 19-26. McShane. Curve space topologies associated with 
variational problems. Ann. Scuola Norm. Super. Pisa, (2) vol. 9 (1940), pp. 45-60. Morse. 
The calculus of variations in the large. American Mathematical Society Colloquium Pub- 
lications (1934), p. 209. 
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We suppose that g(a) satisfies a three point condition defined as follows. The 
relation a = ¢(a) shall hold for three given distinct values a; , a2 , as, of a on 
the interval 0 < a < 2m, where a, a2, a3 are independent of g. A function 
(a) of this nature will be termed admissible. We shall impose another condition 
on the functions g when we define the space M. 

We admit representations of g” of the form 


(4.2) x = g"le(a)] = p*(a) (n = 0,1,---). 


We shall consider the function g(a) as a point in an abstract metric space in 
which the distance between two points g(a) and ¥(a) shall be the number 


(4.3) ve = max | g(a) — (a) | (0 S a S 2n). 
The function f"(v) shall be defined as the Douglas function® 


(4.4) Ong) es (n = 0,1, --- 
2 





where w denotes the parallelogram 
0 B S 2r, 
B-rsasBt+r. 


The Douglas function is an improper integral with an integrand which is singular 
when a = 8B. 
As in MT we introduce the prem integral 


(4.5) H(¢) = a J/* ee — aa =) da dB. 





It follows from the chord are hypothesis on g’ and the ordinary properties of 
length that there exists a constant « such that 


(4.6) [p’(a) — p(s)’ S [v(a) — o(6)P S xfp'(ae) — ppl, 
where « is independent of the choice of g. From (4.6) we see that 
(4.7) fe) = Hy) S wf’). 

Upon comparing f"(y) with H(y) we find that 

(4.8) f"(e) S Hy) S xf’). 


Moreover g” is regular and of class C’ from which we infer that the ratio of an 
arbitrary chord length of g” to either of the corresponding arc lengths exceeds 





* Douglas. I. Solution of the problem of Plateau. Transactions of the American Mathe- 
matical Society, vol. 33 (1931), pp. 263-321. II. The mapping theorem of Koebe and the 
problem of Plateau. Journal of Mathematics and Physics, vol. 10 (1931), pp. 106-130. 
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some positive constant «,. It follows as in the proof of (4.8) that f° < «,f”. 
From this result and from (4.8) we conclude that f"(¢) is finite if and only if 


f’(¢) is finite. 
We shall restrict the space M of points ¢ to points ¢ for which f(y) is finite. 
We shall prove the following theorem. 
THroREM 4.1. At each point of M, 


(4.9) lim f"(¢) = f(¢). 


Let e be a positive constant less than 7 and let w, be the subset of the parallelo- 
gram w on which |a — B| <e. We set 


w=wteoe, (we-we) = 0, 
and 
(4.10 f"(¢) = F(¢, n, e) + F*(¢, n, e), (n si 0, 1, sa ); 


where F(g, n, e) and F*(y, n, e) are defined as is f"(¢) except that w, and we 
shall replace w as the respective domains of integration. From (4.6) we see that 


(4.11) Fg, n, e) S «F(g, 0, e). 


By hypothesis f°(¢) is convergent so that F(¢, 0, e) is convergent. Noting that 
the integral F*(y, n, e) is proper, we see that 


(4.12) lim F*(g, n, e) = F*(y, 0, e). 


n=o 


To establish the theorem, observe that 
(4.13) l f"(¢) — f(g) | s | F(g, n, €) — F(g,0, e) | - | F*(g, n, e) — F*(g, 0, e) |. 


Corresponding to an arbitrary positive constant 7 let e > 0 be chosen so small 
that | F(g, 0, e) | < ». This is possible since F(¢, 0, e) is a convergent integral. 
Making use of (4.11) we see that 


| F(y, n, e) — Fe, 0, e) | S xn + 2. 
With e so chosen, let be so large that 
| F*(y, n, e) — F*(y, 0, e) | < 2. 
Making use of (4.13), we then find that 
If") -P@&)|Sentata, 


and the theorem follows at once. 





5. Verification of hypotheses. We shall show that the Douglas functions 
f"(¢) defined in the preceding section satisfy Hypotheses I to VI of §§1 to 3. 
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Hyporuesis I. That M is boundedly compact relative to f"(y) follows from 
the work of Douglas II. See also Rado’ V, 17, and Courant.° 

Hypotuesis II. To verify this hypothesis we recall the origin of the Douglas 
function. Let g be a simple, closed, rectifiable, coordinated curve given with 
a vector representation x = g(s) in terms of arc length s. Let g(a) be an ad- 
missible function g. Corresponding to the representation x = p(a) = gl¢(a)] 
there exists a harmonic surface x = x(u, v) defined and continuous for uty<) 
and such that 


x (cos 6, sin 0) = p(@), 


where r and @ are polar coordinates in the (u, v)-plane. We shall say that the 
harmonic surface x = x(u, v) is defined by the pair (g, ¢). 
The Douglas function f(g) corresponding to g equals the Dirichlet sum 


1 éx\’ ax\’ 
Dg, ¢) = yA |(2) + (2) eta, 


where R represents the region u’ + vo < 1. 

Let v" be a sequence of points on M converging to ¢’. Recall that g” con- 
verges in length to g’. Hypothesis IT is satisfied if 
(5.1) inf lim D(g", ¢") = D(g’, ¢’). 

Relation (5.1) is a consequence of the lower semi-continuity of D(g, ¢) of 
which a conventional proof may be indicated as follows. Let D(g, ¢, 7) denote 
the integral obtained from D(g, ¢) upon replacing R by the region uw? + 0° < 
r <1. By definition 


Dg, ¢) = lim Dg, ¢, 7). 


The lower semi-continuity of D(g, ¢) follows from the fact that D(g, ¢, 7) is con- 
tinuous in (g, ¢), positive and non-decreasing in r. 
Relation (5.1) holds, and Hypothesis IT follows. 


Hypotuesis II]. Let g(a) be an infinite sequence of points g on M such 
that 


f"(¢') Se (mn <n <---;r=1,2,.---) 


for some constant c and suitable choices of the integers n,. To establish III 
we must prove that some subsequence of the sequence g’ converges to a point 


on M. 
According to the theory of bounded monotone functions there exists a sub- 





7Radé. On the problem of Plateau. Ergebnisse der Mathematik und ihrer Grenzge- 
biete, Berlin (1933). 

§ Courant. Plateau’s problem and Dirichlet’s principle. Annals of Mathematics, vol. 
38 (1937), pp. 679-724. 
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sequence of the functions g’ which converges for each value of a to a non- 
decreasing function 6(a). The function 0(a) satisfies the three point condition 


and the condition 
O(a + 2m) = O(a) + 2m. 


It remains to show that @(a) is continuous. 

Whether continuous or not @(a) possesses right and left limits a; and az at 
each point a9. Moreover | a; — a2| # 22, since 0(a) satisfies the three point 
condition. If |a, — a2| # 0, it follows as in the proof of Theorem III of 
Douglas II, page 113, that 

sup lim D@g”’, ¢’) = ~. 


But 
f""(¢’) aa Dig", ¢) < c. 


From this contradiction we infer that a; = a2 so that @(a) is continuous. Thus 
6(a) defines a point of M, and Hypothesis III is verified. 

Hypotuesis IV. This hypothesis is an immediate consequence of (4.8). 

Hypotruesis V. The functions f"(g) are weakly upper-reducible for n > 0 
in accordance with Theorem 5.1 of MT. 

The proof of Hypothesis VI is more involved. 

Hyporuesis VI. Let g be a simple, closed, coordinated, rectifiable curve, 
and g(a) a transformation which is admissible in the sense of §4. Let A(g, ¢) 
be the area of the harmonic surface defined by the pair (g,¢). In a forthcoming 
paper we shall prove the following theorem of use in establishing VI. 

(i) The area A(g, ¢) is continuous in its arguments. 

The notion of continuity of A(g, ¢) is relative to the concept of convergence 
of g and y. Convergence of curves g shall be convergence in length, and con- 
vergence of y, convergence defined by the distance function gy of (4.3). 

A pair (g, g) which defines a minimal surface will be called differentially critical. 
When (g, ¢) is differentially critical, 


(5.2) Dg, ¢) = AG, ¢), 


as is well-known. Cf. Douglas II, Theorem I. When ¢ is an h-critical point 
of the function f(¢) = D(g, ¢), (g, ¢) will be termed h-critical. That an h-critical 
pair is a differentially critical pair follows from Theorem 6.2 of MT. Hence 
(5.2) holds for h-critical pairs. From (i) we draw the following conclusion. 

(ii). The function defined by D(g, ¢) on the subset H of h-critical pairs (g, ¢) 
is continuous on H. 

Hypothesis VI (b) is satisfied by virtue of (ii). To verify Hypothesis VI (a) 
Wwe return to the sequence f"(¢), and prove the following lemma. 

Lemma 5.1. If ~° is an fh-critical point of f°, the harmonic surface defined by 
(9°, ¥°) is minimal. 
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By definition of an fh-critical point of f°, v’ is the limit of a sequence of h-criti- 
cal points y’ belonging respectively to a subsequence f”’ of the function f". Let 
S’: x = x'(u, v) (r = 1,2, ..-.) 
be the minimal surface defined by (g”’, y’), and let 
S°: x = x"(u, v) 
be the harmonic surface defined by (g’, y’). The harmonic surface S° will be 
minimal if its differential coefficients satisfy the conditions 
(5.3) E(u, v) = G(u, v), F(u, v) = 0 (ue + <1). 


Let E’, F’, G’ denote the corresponding differential coefficients of S’. Re- 
call that 


lim (g"’, ’) = (9°, y’). 


Upon representing x'(w, v) by means of the Poisson integral, one finds that E’, 
F’, G’ converge to E, F, G as r becomes infinite. But S’ is minimal, so that 
conditions of the form (5.3) hold for 8’. It follows that (5.3) holds for S°, so 
that S° is minimal. 

The proof of the lemma is complete. 

By virtue of the lemma, (5.2) holds on the set of fh-critical points of f°. Hy- 
pothesis VI (a) is satisfied by virtue of (i). 

Our principal theorem is as follows. 

TuroreM 5.1. Let g° be a simple, closed, rectifiable curve g° which satisfies the 
chord arc hypothesis. If g’ bounds two minimal surfaces of disc type defined re- 
spectively by points on disjoint minimizing sets of the Douglas function f(y), then 
g also bounds a minimal surface of disc type but not of minimum type. 

It follows from the hypotheses of the theorem that f°(v) possesses two disjoint 
minimizing subsets w; and w,. Let ¢ and y be points on w; and we respectively. 
The 1-parameter family of points 


ty(a) + (1 — bea), (0 <¢t < 1), 


is an f’-bounded are connecting ¢g with y as has been shown in MT, page 451. 
The first hypothesis of Theorem 3.1 is accordingly satisfied. 

Hypotheses I to VI hold for the sequence f"(¢) as we have seen. Theorem 3.1 
thus applies, and we infer the existence of at least one fh- or h-critical point 
v° of f° not of minimum type. But we have just seen that such a point y” 
defines a minimal surface of disc type bounded by g’; this surface is not of mini- 
mum type relative to the Douglas function f”. 

The proof of the theorem is complete. 
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. ON A LEMMA OF McSHANE' 
By Trsor Rapé 
(Received January 4, 1940) 


1. McShane, in his work on the lower semi-continuity of double integrals in 
Calculus of Variations,” established the following important lemma.*® In the 
unit square Q: 0 S u S$ 1,0 Sv S 1, let there be given triples of continuous 
functions x(u, v), y(u, v), 2(u, v), tn(U, V), Yn(U, Vv), Zn(u, v) such that the following 
conditions are satisfied. 

I. The partial derivatives 2, , 2, Yu, Yo, Zu, 2% exist almost everywhere in Q. 

II. The Jacobians 


X = Yulv — Yoru; Y = 2p — 2yTu, Z = Xo — TWYu 


are summable in Q. 
III. The functions x(u, v), y(u, v), 2(u, v) are absolutely continuous on the 
boundary B of Q. 
IV. We have 
1 


J] Xéem = si (ydz — zdy), J] Y dudv = sf (zdx — xdz), 


[[ Zaua = sf Gay — ydz). 
Q 


V. The functions z,(u, v), yn(u, v), 2n(u, v) are quasi-linear* in Q. 

VI. z,(u, v) > x(u, v), yn(u, v) > y(u, v), Zn(u, v) — 2(u, v) uniformly in Q. 

Then for every choice of the constants a, b, c there exists a sequence {V,} of meas- 
urable subsets of Q such that 


| [ ex, + oY. + fy) dude sae [fox + vY + cf) dude, 
Va Q 


where 
X,, _ Ynuk nv oe Ynv@nu 5 ro = Z2nutlnv — Znv0nu » Zn — LnuYnv a LnvY nu . 


2. In view of the importance of this lemma, it seemed to be of interest to 
investigate the possibility of replacing the conditions I-IV of McShane, con- 





* Presented to the American Mathematical Society at the meeting in Chicago, April 1940. 

*E. J. McShane, Integrals over surfaces in parametric form, Annals of Mathematics, 
vol. 34, 1988, pp. 815-838. 

* Loe. cit.,? p. 829. 

*A continuous function f(u, v) is quasi-linear in Q if Q can be subdivided into a finite 
number of triangles on each of which f(u, v) is linear. 


73 











74 TIBOR RADO 


cerned with the limit triple z(u, v), y(u, v), z(u, v), by less restrictive ones. It 
should be noted that the purpose of conditions III and IV is to provide, for 
the integral of aX + bY + cZ, a geometrical interpretation which is essential 
for the proof of McShane. It may be therefore of interest to point out that 
the lemma remains valid if the apparently decisive conditions III and IV are dropped. 
The purpose of this paper is to prove this assertion. Applications of the result 
will be considered on another occasion. 

3. As stated above, the lemma involves three arbitrary constants a, b, c, 
but it is evident from the work of McShane that it is sufficient to establish the 
apparently very special case a = b = 0, c = 1, since the general case can then 
be disposed of by a very simple device. For this reason, and also for easier 
reference, we wish to state our result for this special case explicitly. 

4. THeorEM. In the unit squareQ: 0 S u £ 1,0 Sv S 1, let there be given 
pairs of functions x(u, v), y(u, v), tr(u, Vv), Yn(u, v) such that the following condi- 
tions are satisfied. 

I. The partial derivatives Xu , Xv , Yu» Yr exist almost everywhere in Q. 

II. The Jacobian J = tuYo — LeYy 18s summable in Q. 

III. The functions x,(u, v), Yn(u, v) are quasi-linear in Q. 

IV. xn(u, v) > x(u, v), yn(u, v) > y(u, v) uniformly in Q. 

Then there exists a sequence of measurable subsets V,, of Q such that 


[[ teaudo— ff sauce, 


Vn Q 
where Jn = InuYav — TavYfau - 

5. Applying a device due to McShane,’ we proceed as follows to derive from 
this statement the one described in section 2. Let us consider the triples x(u, v), 
y(u, v), 2(u, v), La(U, V), Yn(U, v), 2n(u, v), but let us assume only that conditions 
I, II, V, VI of section 1 are satisfied. Let there be given a set of constants 
a,b,c. Without loss of generality we can assume that a? + b? + c? = 1. We 
can find then an orthogonal matrix, with determinant +1, of the form 


Qi Ay Arg 
Goi Ae2 A 
a b Cc 
Let us put 


E(u, v) = ayx(u, v) + aywy(u, v) + aygz(u, v), 
E,(u, v) = antn(u, v) + ayyn(u, v) + ass2n(u, 2), 
(u,v) = anx(u, v) + aoey(u, v) + aese(u, v), 


Gn(U, Vv) = Anxcu, v) + aeeyn(u, v) + Gegzn(U, v). 


Let J, J, denote the Jacobians of the pairs 2, gand Z, ,g, respectively. Clearly, 
the pairs £, 7, Z, , J, satisfy the conditions of section 4 and therefore we have, 





5 See loc. cit.3 
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by the theorem stated in that section, a sequence {V,} of measurable subsets 


of Q such that 
[J Jndudo— [ [ Jando. 


Van Q 


This sequence possesses the desired property, since we have by direct 
computation 


J =aX + bY + cZ, J, = aX, + bY, + cZ,. 

6. The rest of this paper contains the proof of the theorem of section 4. 
Sections 7-12 contain some facts needed in the sequel, while sections 13-17 
contain the proof itself. 

7. Let G be a bounded measurable set in the zy plane and h(z, y), Aa(x, y) 
summable functions on G such that h,(z, y) — h(a, y) on G. We assume that 
these functions do not take on the values +«. Under these conditions, there 
exists a sequence of measurable subsets G, of G such that 


[/ hadedy — [ [ hardy. 
Gn G 


Proor. Define 


Kn = gr.l.b. 


Jf raed _ Jf heady 


where the greatest lower bound is taken with respect to all measurable subsets 
E of G. Give any e > 0. Since h(z, y) is summable on G, we have then an 
n = n(e) > Osuch that if S is any measurable subset of G we have 


[ f nas ay 


8 





<e if |S|<7n° 








Since h, — h on G, we have by the theorem of Egoroff’ a measurable subset H 
of G such that 


|G-H|<y 
and such that h, — h uniformly on H. Hence 


[fin -nlardy 0 for n— @, 
a 


and consequently 
0<limn <<, 


no 








‘ If S is a measurable set, then | S | denotes its Lebesgue measure. 
’ See for instance Saks, Theory of the integral, Warszawa 1937, p. 18. 














TIBOR RADO 


Jf raed — lJ had 
JJ nazdy|+ JJ 18—m nlaedy <e+ [hh | dedy. 


G—H 





As € > 0 was arbitrary, it follows that x, > 0. By the definition of x, , we have 
a measurable subset G, of G such that 


[[ ravay — If h,dxdy| < mt. 


G Gn 


As k, — 0, the sequence G, has therefore the desired property. 

8. Suppose the functions f(u, v), g(u, v) are merely continuous on the unit 
square Q:0 Su51,0 5021. Let (uw, v) be an interior point of Q and q 
a square in Q with center at (uo , vo) and sides parallel to the axes. Denote by 
b the boundary of g. Take four constants a, 8, y, 6 and put 


_ |f(u, v) — f(uo, %) — a(u — uw) — By — vo) | 
p(uo, %, 9, f, a, B) = hepa [(u — wo)? + (v — wo)?}? : 








as | g(u, v) — g(wo, vo) — y(u — uo) — 8(v — 0) | 
aa ae a [(u— w+ @— 9) 


If there exists some sequence of squares g, such that simultaneously 


p(Uo, 0, Qn, f, a, B) > 0, p(Uo , V0, In, 9; Y, 5) — 0, |qn| 0, (1) 


then we shall say that the pair f(u, v), g(u, v) satisfies condition C at the point 
(uo , Uo) with the constants a, B, y, 6 

9. The reader will have noticed that the preceding condition C is merely a 
greatly weakened form of total differentiability. As it might be expected, the 
methods developed by Rademacher, Stepanoff and others to obtain results of 
surprising generality concerning total differentiability,” can be easily adapted 
to our condition C. One statement obtained in this manner is as follows.” 
Suppose that the functions f(u, v), g(u, v) are continuous in Q and that the 
partial derivatives f, , f. , gu, gv exist almost everywhere in Q. Then condition 
C is satisfied at almost every point (uo , v0) of Q with the constants a = f,(uo, 
vo); B= fo(uo , vo), 1. = Gu(Uo , vo), 6 = GJo(Uo , Uo). 

10. Consider again a pair of functions f(u, v), g(u, v) which we assume to be 
merely continuous in Q. These functions define a continuous transformation 





§ See the excellent presentation loc. cit.,7 Ch. 9. 
* For the details of the proof, see the author’s paper, On absolutely continuous transforma- 
tions in the plane, Duke Math. Journal, vol. 4, 1938, pp. 189-221, in particular p. 219. 








ON A LEMMA OF MCSHANE ‘4 


x = fu, v) 


y = g(u, v) inline 


Let q be a square in Q, with sides parallel to the axes, and let b be the boundary 
of g. If the point (u, v) describes 6 in the counter-clockwise sense, its image 
under 7' describes in the xy-plane a continuous closed oriented curve 6’. In the 
y-plane we define a function y(z, y, g, T) as follows. If the point (z, y) is on 
b’, then u(z, y, g, T) = 0. Otherwise u(x, y, g, 7’) is equal to the topological 
index of (x, y) with respect to b’. Weshall denote by S,(q, 7), S2(q, 7’) the point- 
sets in the zy-plane where u(x, y, g, T) is equal to +1 and —1 respectively. 
These sets are bounded and open (possibly vacuous). 

11. Keeping the notations of the preceding section, let us consider an interior 
point (uw, vo) of Q where the condition C of section 8 is satisfied with some 
constants a, B, y, 6. Let {qn} be a sequence of squares, as specified in section 8, 
such that the relations (1) hold. We consider the auxiliary transformation 


x = f(uo, vo) + alu — uw) + B(v — w»), 
y = g(uo, vo) + y(u — uw) + d(v — ), 


Assuming that ad — By ¥ 0, the image of the square gq, under 7” is a parallelo- 
gram +, in the zy-plane. Suppose first that ad — By > 0. Then the set 
Si(qn , T*) is simply the interior of x2 , and condition C implies, by a familiar 
elementary reasoning,” that for large values of n the symmetric difference” of 
the sets Si(gn , T*) and S:(q,, 7’) is contained in a narrow strip ony bounded 
by two parallelograms similar to x,, such that |¢,|/|a,|—0. As 
| ra |/| qn | + a5 — By, we also have | on |/|q.|— 0. Thus a fortiori 


| Sign, T) | we | Si(qn, Sar i 
| Qn | 


But, since 7* is an affine transformation, 


| Sign, T*)| _ lan 


(u, v) €Q. 








— ai — By. 
| Qn | | Qn | 
Hence 
Sides 2 | —ai— By if ab — By>0. 
Similarly it follows that 
— Bites Tob — Bry if ad — By <0. 





1° For a detailed discussion of a practically identical situation, see the author’s paper, 
Uber das Fléchenmass rektifizierbarer Flaéchen, Math. Annalen, vol. 100, 1928, pp. 445-479, 
in particular pp. 461-466. 

The symmetric difference of two sets A and B consists of those points which belong 
to exactly one of A, B. 
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12. Using the notations of section 10, let us assume that f(u, v), g(u, v) are 
quasi-linear in Q. Let us denote by J(u, v) the Jacobian fug. — figu. Let us 
take, in the zy-plane, any summable function F(z, y) which takes on only 
finite values (summability meaning that F(z, y) is summable on every bounded 
measurable set). Then we have for every square q in Q the transformation 
formula 


[f Firs»), o(u, I, ») dude = | f Fee, Wale, yg 1) de dy.” 


As a matter of fact, this formula holds under very general conditions.” For 
quasi-linear transformations, of course, the formula is practically trivial. 

13. We proceed presently to prove the theorem of section 4. Through the 
rest of the paper the notations of that section will be used. Let q be a square 
in Q with sides parallel to the axes. We define 


An(q) = gr.l.b. [[ rad - [{ J,dudv|; 
eco |e “ 


that is, the greatest lower bound is taken with respect to all measurable subsets 
E of gq. We define further 


dq) = lim An(q). 


no 


By taking in q a set E of measure zero, we see that 


An(q) S | [ saa! < [fi ziauae, 


and hence also 


(qq) S [fis iaude. 


Take now in Q any finite or infinite sequence of non-overlapping squares 
M1, 42, -*+,@Q;,---, Such that 


lQ-— diai| = 0. 


We assert that 
(Q) S D> (qi). 


# As u(z, y, g, 7) vanishes outside of a sufficiently large circle, the range of integration 
on the right can be taken as the whole zy-plane. 

43 See, also for further literature, loc. cit.2 It is worth noting that the quasi-linear 
character of f(u, v), g(u, v) is used only to secure the above transformation formula. This 
remark suggests further generalizations which we do not wish to discuss at this time. 
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Proor. Observe first that the series on the right is convergent since it is 
dominated by the series 


X ff iy iauas 


which converges because the squares g; do not overlap. Givenowe > 0. By 
the definition of \,(q;), we have a measurable subset EF’, of q; such that 








[f tava _ If J,dudv| < Xn(q)) + >" (3) 
Put a * 
E, = LEs. 


Since the squares g; do not overlap, we have by (3) 


[fraud _ [[ teaudv 
@ En 


<z 


AQ) S 














aj 


[[ sandy — [/ Laud < 2 dali) +e (4) 


Take any positive integer 7. We have then by (2) 
y An(G;) S > An(Qi) + hte [/ | J | du dv. 
7 | ao =i 


qj 
Hence 


lim Dra) S Dra) + Df f | laude 


Qj 


Since the squares q; do not overlap, the second summation on the right con- 
verges to zero fori — ©. Thus 


lim D> Aa(g;) SD A). 


n—o 7 


From (4) we infer now 


XQ) = lim 4,(Q) $ HAG) + 


arn . 


Since ¢ was arbitrary, this proves our assertion. 
14. We consider now the transformation 


Le} = x(u, v), 


(u, v) €Q. 
ly i y(u, v), 





* We discuss the case of an infinite sequence q;, since the finite case is trivial. 
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12. Using the notations of section 10, let us assume that f(u, v), g(u, v) are 
quasi-linear in Q. Let us denote by J(u, v) the Jacobian fugo — frgu. Let us 
take, in the zy-plane, any summable function F(z, y) which takes on only 
finite values (summability meaning that F(z, y) is summable on every bounded 
measurable set). Then we have for every square g in Q the transformation 
formula 


[[ Ptr, v), ou, o1ICu, ») dudo = | f Fe, Wule, v, a, 7) dx dy.” 


As a matter of fact, this formula holds under very general conditions.” For 
quasi-linear transformations, of course, the formula is practically trivial. 

13. We proceed presently to prove the theorem of section 4. Through the 
rest of the paper the notations of that section will be used. Let q be a square 
in Q with sides parallel to the axes. We define 


An(qg) = gr.l.b. [f tauay = [J J,dudv}; 
ECa 
qa E 
that is, the greatest lower bound is taken with respect to all measurable subsets 
E of q. We define further 
d(q) = lim A,(q). 


noo 


By taking in q a set EF of measure zero, we see that 


An(q) S | [ sauao| < [fir idude, 


and hence also 


na) sf f |s\duae. 


Take now in Q any finite or infinite sequence of non-overlapping squares 
M1, 42,-*:,Q;,---, Such that 


1Q- do a;| = 0. 
We assert that 


\(Q) = >> Aq). 


As (2, y, g, T) vanishes outside of a sufficiently large circle, the range of integration 
on the right can be taken as the whole zy-plane. 

8 See, also for further literature, loc. cit. It is worth noting that the quasi-linear 
character of f(u, v), g(u, v) is used only to secure the above transformation formula. This 
remark suggests further generalizations which we do not wish to discuss at this time. 
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Proor. Observe first that the series on the right is convergent since it is 
dominated by the series 


x ff is taud, 


which converges because the squares g; do not overlap. Give nowe > 0. By 
the definition of \,(q;), we have a measurable subset E?, of q; such that 





[f sauao — J J r-avas| < Malai) + &. (3) 
Put “ * 
E, = 2 En. 


Since the squares g; do not overlap, we have by (3) 


[J sam oF 
[fraud — ff trauao 


qj Ei. 


\n(Q) 








<> < 2 rag) + € (4) 








Take any positive integer 7. We have then by (2) 


Dd An(Qi) Ss > ra(q + , lf | J | du dv. 

7 7=1 7=i+1 4 

Hence . 
lim De An(qi) Ss D A(qi) + >» [/ | J | du dv. 
no 7 7=1 7=it+1 


qj 


Since the squares g; do not overlap, the second summation on the right con- 
verges to zero fori — ©. Thus 


lim > a.) S XQ). 


no 7 


From (4) we infer now 


XQ) = lim d.(Q) S$ AG) + 


n-?o e 


Since ¢ was arbitrary, this proves our assertion. 
14. We consider now the transformation 





- ig ae (u, v) €Q. 


ly _ y(u, v), 





* We discuss the case of an infinite sequence q; , since the finite case is trivial. 
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15 


Let g be a square in Q with sides parallel to the axes. We assert the inequality 


x@) 5 min | | { ronan - |S(q, 7) |, Jf sande + ista.7\], (5) 


qa 


where the sets S,(q, 7’), Se(q, 7’) are defined as in section 10. 
Proor. Let us consider the transformations 
t= 2,(u, 2), 
T..: | (u, v) €Q. 
Ly = yalu, »), 
In the zy-plane, let S be the set where u(z, y, g, T) # 0. Since z,(u, v) > 
r(u, v), Yn(u, v) > y(u, v) uniformly in Q, and hence in q, we have 


u(x, Y, q, Tn) as u(2, Y, 4; T) 

on S, and hence also on the sets Si(q, 7), Se(q, 7). Since 7’, is quasi-linear, 
u(x, y, g, T'n) is summable. We cannot assert that u(x, y, g, 7) is summable, 
but this function is surely summable on S,(q, 7) and S2(q, 7), since | u(z, y, 
q, T’)| = 1 on these sets. Hence we can apply the remark in section 7 with 
h(x, y) = w(x, y, 9, T), ha(x, y) = wz, y, Q, Tn) and first with G = S,(q, 7) 
and second with G = S.(q, 7’). It follows that we have sequences of bounded 
measurable sets Gi, , G, such that 


I u(z, y, 7, Tn) dx dy > II u(x, y, 9, T) dxdy = | Si(q, T) |, (6) 


Gc; 81(¢,T) 


If u(x, ¥; 4; Tn) dx dy —_ If u(x, Y, 9; T) dx dy = | S2(q, T) |. 

rig S2(q,T) 
Now let E;, be the complete model” in q of the set Gj, under the transformation 
T,. The transformation formula of section 12 yields then, if F(z, y) is taken 
as the characteristic function of the set Gi, , the formula 


IJ u(x, y, 9, Tn) dx dy = II J, dudbv. (7) 


, 
Gn En 


Similarly, if EZ, is the complete model of G’, in q under the transformation 7, , 


we have 
[f we, y, 9, T,) dxdy = [/ J,dudv. 
en’ . ee 


Now, by the definition of \,(q), 


r»(q) S [f taudv ~ [[ teduav), 


q E,, 





18 If a, b are real numbers, then min (a, b) denotes the smaller one of a, bifa ~ b and 
the common value of a, b if a = b. 


16 That is, E, is the set of all those points in g whose image under T’, is comprised in G;. 
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and for n — © it follows by (6), (7) that 
[[ aud — |S.(q, |). 


qd 


Aq) S 





Similarly 


@ <| [fraud + | 5.0 |). 


q 





The last two inequalities imply (5). 

15. Denote now by E the set of those interior points (uo , v%) of Q where the 
following conditions are satisfied. 

I. tu, Zo, Yu y Yo exist at (Uo, %). 

II. Condition C of section 8 is satisfied at (wo , v9) with the constants a = 


Lu(Uo , Vo), B = Ty(Uo ; V0); = Yu(Uo ’ v0); 5= Yo(Uo ; Uo). 
III. If gis a square with sides parallel to the axes and with center at (uw, vo), 


then 
/ / Sduie 


+———— J(u, %) for |gq|— 09. 
la| 
Since J(u, v) exists almost everywhere in Q and is summable there by assump- 
tion, III is satisfied almost everywhere in Q by a well-known theorem, while 
II is satisfied almost everywhere in Q by section 9. Thus 


|Q-— E| =0. 


16. Now let (wo, vo) be a point of the set EZ of the preceding section. By 
condition II, we have then a sequence gn(wo , v0) of squares with sides parallel 
to the axes and with center at (w , v) for which the relations (1) of section 8 
hold with f(u, v) = x(u, v), g(u, v) = y(u, v), a = tu(uo, Vo), B = Ly(Uo, r%), 
Y = Yu(Uo, V0), 5 = Yr(Uo, Vo). By section 11 we have therefore 


| Si(qn(ro, vo), T) | 
Qn(Uo, v0) | 





—> J(uo, vo) if J(uo, %») > 9, 


and 


_|S2(¢a(wo, v0), T)| _, To, %) if J(u, %) <0. 
| Qn(Uo , vo) | 





But, by condition ITI of section 15, we also have 


[/ J dud 


dn(%9,%0) _ J(w, Up). 
| Qn(Uo, vo) | 
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lf J dudv — | Si(qn(uo, v0), T) | 


Gn(%o,v0) —0 if J(u, vo) > 0, (8) 
| qn(wo, Vo) | 





[ff sauce + | Sx(¢n(uo, v0), 7)| 


Gn(uovvo) —0 if J(u, um) <0. 
| qn(tuo , Yo) | 


By (5) in section 14 the relations (8), (9) imply that 





A(gn(uo, vo)) +0 
| Qn(Uo, v0) | 


if J(uo , 0) # 0. If J(u, v0) = 0, then we infer by section 13 and by condition 


III of section 15 that 
/ / J dudv 


O< A(qn(uo , Uo)) < Gn(%9,¥9) ~ | J(uo, v») | oo @ 
sae | Qn(Uo, vo) | ~ | Gn(uo , vo) | 





17. Thus we see that for every point (uo , vo) of the set EZ of section 15 there 
exists a sequence of squares ¢,(uo , ¥0), With sides parallel to the axes and with 
center at (uo, ), such that 


An (uo ? Uo)) 


eae pee d 7 ; 0. 
| @n(to, Vo) | an | dn(uo, Yo) | 


Besides, all the squares g,(uo , Yo) are comprised in Q. Givenowe > 0. If we 
discard, for each point (uo, vo) of EZ, a finite (sufficiently large) number of the 
squares gp(Uo , Vo), the remaining ones will satisfy the relations 


A(Qn(Uo , V0)) < €|Qn(uo,%) |, | @n(o,%)|—70, gn(uo, %) CQ. 


The squares gn(uo, vo) cover the set # in the manner required by the Vitali 
covering theorem. We can select therefore from amongst the squares gn(Uo , Vo) 
a sequence of non-overlapping squares qi , 92, --- , @;, --- which cover # with 
the possible exception of a set of measure zero. Since 


DacQ and |Q-E|=0, 


we also have 


1Q- DLia;| =0. 


Hence by section 13 
(Q) $ Dag) se LDiai| =«/Ql =« 
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As e« was arbitrary, it follows that \(Q) = 0. By section 13 this implies that 
An(Q) — 0. (10) 


By the definition of \,(Q), there exists a measurable subset V, of Q such that 


[fraud _ [| J, du dv 
, Q Va ‘ 
(10) and (11) imply that 


[[ teduao— ff sande, 


Vn Q 


< »(Q) ++. 
n 





and thus the theorem of section 4 is proved. 
18. As far as applications are concerned, it seems that in some cases it would 
be sufficient to know that we have a sequence V, such that 


lim ff J,dudv = [f raud. 
Va Q 


It is however unlikely that this relation can be derived under assumptions 
weaker than those needed to obtain the precise conclusion of the theorem of 
section 4. . 


THe Onto State UNIVERSITY. 
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ON FINSLER AND CARTAN GEOMETRIES. III 
TWO-DIMENSIONAL FINSLER SPACES WITH RECTILINEAR EXTREMALS! 


By L. BERWALD 
(Received August 26, 1939) 


Introduction. The principal part of the present paper is devoted to the 
problem, proposed by P. Funk,’ of characterizing, in an invariant manner, the 
two-dimensional Finsler spaces the extremals of which can be given, in a suitable 
coordinate system, by linear equations. We call such spaces Finsler spaces with 
rectilinear extremals. 

In an introductory section, we explain briefly, from the beginning, the theory 
of the two-dimensional Finsler spaces developed especially by the author’ and 
by E. Cartan. The standpoint of this exposition is predominantly formal. 
Our aim is to develop Cartan’s theory of the two-dimensional Finsler spaces 
independently of the general theory of equivalence, and to connect it with his 
later theory of the n-dimensional Finsler spaces.’ The single new feature in 
this section is the connection between the tensor FG}, and the main scalar of a 
two-dimensional Finsler space, given in §8. 

Section II develops two different methods which lead to an invariant charac- 
terization of the two-dimensional Finsler spaces with rectilinear extremals. 
The first is purely analytical, and is based upon the discussion of the conditions 
of integrability of a certain system of partial differential equations (§§9-11). 
The scope of the second method is first to establish necessary and sufficient 
conditions in order that a two-dimensional general geometry of paths may have 
rectilinear paths,° and then to apply them to a Finsler space. This method has 
the advantage of showing what is the independent significance of each of the 
two conditions we obtain (§§12, 13). 

In the third section we determine all two-dimensional Finsler spaces with 
rectilinear extremals the main scalar of which is a function of position only. 
First we establish some theorems showing that for such a space the main scalar 





1 The first two papers of this series are L. Berwald, [2], [3]. (Numbers in cornered 
brackets refer to the bibliography at the end of the paper). 

2 See P. Funk, [11]. 

3L. Berwald, [1]. 

4 E. Cartan, [5]. 

5 E. Cartan, [6]. 

° For an n-dimensional general geometry of paths (n > 2), J. Douglas, [7], established 
the corresponding conditions. When n = 2, only one of the two conditions for rectilinear 
paths results from Douglas’ corresponding condition by particularization, the other does 
not. The reason is that, when n = 2, the generalized Weyl projective curvature tensor 
(H. Weyl, [17]; J. Douglas, [7]) vanishes identically. 
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is always, and the curvature nearly always, a constant (being necessarily null 
if the main scalar does not vanish). There exists but one exception: the case 
in which the main scalar has the value +3/+/2 (§15). This exceptional case 
is studied in §§16, 17. In toto, there are six types of two-dimensional Finsler 
spaces of the desired kind. Four of these types depend on arbitrary constants, 
a fifth on an arbitrary function of a single variable (§18). Finally we determine 
all Landsberg spaces with rectilinear extremals (§19). 


I. TwWo-DIMENSIONAL FINSLER SPACES 


1. The metric. We start with an n-dimensional manifold with coordinates 
zr’ in which a variational problem with the fundamental integral 


t t t 
(1.1) s= / F(a’, 2’, «++, 2°32", 2", ---, 2" dt = / F(a, x’) dt, (2 on OF 
ty ty dt 


isgiven. Wesuppose that F is analytic in a certain region % of its 2n arguments 
to which we restrict ourselves, and that 8 contains no points with x” = 





zg” =... = 2" = 0. Further F is supposed to be positive and positively 
homogeneous of the first degree in x’. Finally, we suppose F; > 0, where 
oF | oF 
w 1 ax'* dx! | da’ 
dx’ 


The second factor on the right of (1.2) stands for the determinant of an (n + 1)- 
rowed matrix. A manifold of the considered kind is called an n-dimensional 
Finsler space,’ and F its fundamental function. 

We interpret s as‘arc-length of the curve z* = z‘(t). As element of space we 
regard the oriented line-element (x, x’), that is, a point (x) and a direction 
(px’; p > 0) issuing from it. The quantities with which we have to deal 
(tensors, densities, and so on) depend exclusively on the line-element, i.e. they 
are positively homogeneous of degree zero in x’. 

The metric of a Finsler space is based upon the symmetric covariant tensor of 
the second order 





0 nie) =} 20, 
On account of 

(1.4) g = det. (gu) = F""'F, > 0, 

an n-dimensional euclidean metric is associated with each line-element (z, 2’) by 
(1.5) do” = gix(x, x’) dx‘ dz‘ * 





7 P. Finsler, [8]. 
* Repeated indices indicate summation. 
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This metric is particularly used for measuring the length of vectors. According 
to the homogeneity of F we have 





| (1.6) guz 2" = F, 
, oF 
; Fy ‘ = P . 
; (1 7) Jin L ar’ 


) Therefore orthogonality to the line-element (z, x’) in the metric (1.5) is identical 
; with transversality to the line-element. 
| In consequence of (1.4), the tensor g™ conjugate to gx exists. By means of 










gi, and g™ the lowering and raising of indices is defined in the usual manner. 
We can therefore speak of covariant, contravariant, mixed components of a 
tensor. 


The unit vector of the line-element (z, x’) has the contravariant components 
7 dx‘ x" 

1.8 [f= — =, 
a8) ds F 


and the covariant components 


oF 


(1.9) l; — ax’i’ 








Inner multiplication with the unit vector (l) is indicated by a zero, the position 
of which (above or below) does not matter; for instance 


; (1.10) Tx = Tht =Th, -— T® = T* = TP. 










2. Extremals and parallel displacement. The derivative ,. The extremals 
of the variational problem 6s = 0 have the differential equations 













(2.1) fh. A... 


With regard to F («, *) = 1 and to the homogeneity of F, (2.1) can be written 


as 

d’ x’ :( dz 
2.2) eel r panel Ie 
( ds? += («, *) , 
or 








dl’ G'(x, x’) 
2.3 a — = 
_ ds bins F°(a, x’) 0, 
- where 
7 2772 
¥ 2.4 G yy a BAe cs) (F’) m a(F’) 
By | (2.4) (z, 2) = 4g E xm” dx | 


G' is therefore positively homogeneous of the second degree in 2’. 












det 
of 

Ri 
spi 


(2. 
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We indicate by ®, the derivative of any function (z, x’) with respect to the 


are s of the extremal issuing from the line-element (z, x’). Because of od = 0 


and of the homogeneity of G ‘ we have 


(2.5) @, = bil’ = dq, 
where 

ob 0b OG” 
(2.6) 9.5) — oxi _ ax’ art’ 


If @ does not depend on 2’, #:;) reduces to the partial derivative . For a 


scalar &, &;) are the covariant components of a vector. If ® is a tensor which 


is not a scalar, the differentiation --- ;, does not generate a tensor. In par- 
ticular we have 
(2.7) F 0. 


To introduce a covariant differentiation for other quantities than scalars, we 
define the parallel displacement of line-elements and the parallel displacement 
of quantities from one line-element to another parallel. By analogy with the 
Riemann space, it is natural to take as coefficients of a connection in the Finsler 
space the functions 


ri = Tit = 9" Tin, 
(2.8) where 
(Tins = FGinay + gin — gi). 
From (1.3) and (2.4) it follows that 


ek opi ac 
(2.9) liz" = aah 


The parallel displacement of the line-element (z, x’) to the point (x + dz) is 
now defined by 


*k pip ‘i 
li; 22" = 20°. 


. OF as) = 0, 


2.10 ‘4 Prt de = dl += 
(2.10) d+ PTs; de’ = dl + 5 oO , 


and the parallel displacement of an arbitrary vector X' from its line-element 
(x, x’) to the line-element obtained from (x, x’) by parallel displacement to the 
point (x + dx) by 

(2.11) dX‘ + X"ryxi dx’ = 0. 


The left member of (2.11) is the invariant differential DX‘ of the vector X 4 
For any quantity, it is defined in a corresponding way, as is well known. The 
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coefficient of dz’ in the invariant differential of a quantity  is-called the covariant 
derivative |; of the quantity. For instance, we have 


2.12) . ‘= xt, + X'TH, 


Tix; = Tew — Tuli; — Tali; ; 

where X{,) , Tix; are given by (2.6). (2.12) defines the covariant derivative 
also for sets of functions which behave like tensors under transformations of 
coordinates, but are not homogeneous of degree zero in z’. For a scalar, the 
differentiations ---,; and -- - |; are identieal. 


From the definition of the covariant derivative and from shoes %; @ 10)—(2.12), 
it follows. immediately that , 
+ ~- f(a) Fi; = 0, (b) - gutz = 9, 
(2.13) ; ne , ; 
(ec) Y,;=0, la; =0. 
Besides the covariant derivative we use in the following the derivative 


G14 ste : O44, = FS. 


For a Simtiann ® eeeieiee of degree zero in x’, we have 

(2.15) — Pj \o = 0. 

Both differentiations ---|; and ---j||; generate from a given quantity a new 
quantity of the same kind having a subscript more. 


Finally, we state the connection between Tj and G*. For that purpose we 
consider the symmetric tensor of the third order 


(2.16) An; = 291; = 4F ms ae 
We have then 

(2.17) Aj = —49113, 

and with regard to the homogeneity of F 

(2.18) Ain = Aw; = An; = 0. 


If we write 


ac" yk 
(2.19) Pe Gt. 


the connection sought is given by 


(2.20) ry = Gi, - aa? 





* See E. Cartan, [6], VIII. 
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3. Permutation formulas. The differentiations ---,;, and ---|\; satisfy the 
permutation formulas 





(3.1) Byun — Paw = —PiN Rox , 
where , ? 

» ag Gg , 0G’ aG* 
(3.2) Rox = F (oa  Ox"* axt — Gis sai ax’k + Gis s). 
and 
(3.3) wile — Bila = —Pil(Akelo + Vie 


Rj. is a tensor which results from Cartan’s curvature tensor Rj. by inner multi- 
plication with [*."° (3.3) is obtained by means of (2.7) and (2.20). 


4. Two-dimensional Finsler spaces. Normal vector and main scalar. In a 
two-dimensional Finsler space, besides the unit vector (l), the normal vector, 
i.e., the unit vector orthogonal to (1), is of importance. If h; are its covariant, 


h' its contravariant components, we have 
4.1) (a) gin = Lilk + hihk , 
“" (b) of = Ltt + hyn*, (6k = 1 fori = k, = Ofori ¥ k). 


The connection between the vectors (1) and (h) is given with the aid of the 
e-tensor the components of which are 
eas 0, a2 = V9; é1 = —V9; 3 = 0, 
(4.2) 11 12 1 21 1 22 
’ e =0, e =—, € =--—, € =0. 
V9 v9 


Here and later on, ~/ indicates the positive root. It is seen that 


(4.3) €x + «a: = 0, ee ix = FY , een, = 2. 

Now we obtain, if the orientation of the normal vector is suitably chosen 
\ li = eh, l; = exh, 

(4.4) (b) hi = —e*,, h, = —exl", 
le hi —Uhi =e", lik — lah = ex. 


Using (2.13, b, ec), we have ex); = ei = 0 and therefore 
(4.5) ii = 0, hil; = (). 


The normal vector can be used for expressing the tensor Aix; by a scalar. 
With regard to (2.18) we can put 


(4.6) Aix; = 3G ix lj = Ihihyh; . 





‘°K. Cartan, [6], formulas (XIX), (XX), p. 36. 
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Hence we have in particular 
(4.7) (V9) ii = IV gq hy. 


° . ° ° . il 
I is called the main scalar of the two-dimensional Finsler space. 


5. The derivative >,. By means of the normal vector we define the deriva- 
tive 
(5.1) &, = Sah’. 
Then we have conversely 
(5.2) Bi = Bl; + Boh; . 
For a function ® positively homogeneous of degree zero in x’, the derivative 
, corresponds to the difference of its values in the line-element (xz, x’) and in 
the line-element obtained by parallel displacement in transversal direction. 
Since from (2.13¢), (4.5) it follows that 


1 aG’ 


a ri 
“Bap? ha = OAT 


(5.3) lin = 


and 
(5.4) lin = Lay = LG4;, hun = haw = hTG , 
we get 
G' ae 1 ag’ *i 
5.5 Eu — >=h} = —_—_ hh >= —T,, Ah’ 
(5.5) 2a? L=h Pagh hp Teak, 


and 


an, a SO 


PF ar’i rT) (lids _ Gislyh’, 


(5.6) 


it 1 aG” Ba *r 8 
hi). —_ P agi (hi)s — h,h : 


The first equation (5.5) is taken from (2.3). 


6. Landsberg’s angle and the derivative ;. In a two-dimensional Finsler 
space, we introduce for functions &, which are positively homogeneous of degree 
zero in x’, in addition to the derivatives , , &, the derivative 5 with respect to 
Landsberg’s angle. Landsberg” defines the angle at the point (x) by the 
integral 


lia Wh 
(6.1) = [x = | nat! 


ZL. Berwald, [1], p. 204. There ¥ = 4J is called the main scalar. 
12G. Landsberg, [12]. 
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up to an arbitrary additive function of position. The angle between two line- 
elements (x, ’) and (a, 2’) at the point (x) is defined by the definite integral 
(6.1), taken from #”:2" to 27:2". 

Because of (6.1) we have #||; = A; and therefore, for a function ® positively 


homogeneous of degree zero in 2’ 
(6.2) @iji = Bhi, Ba = Bish 
From (4.7) and (4.2) it follows in particular that 


(gixdo = 2Thihy , 
(6.3) 


0 


(J/ps=IV9g, Vo= “al :, 


where g(x) is an arbitrary function of position, and 


(6.4) (exo = lex, (e)y = —Ie". 
Further we get from (1.8), (1.9), (4.4) 

ls = h', hy = —l' — Ih’, 
(6.5) 


(lids 5 h , (his = —l; + Th;. 


7. Curvature of a two-dimensional Finsler space. Cartan’s permutation 
formulas. On account of (2.13b), the parallel displacement (2.11) is euclidean 
(or metrical). Therefore the curvature tensor Rj jin = g,;lhix is skew-symmetric 
in the two first indices also.” Hence we have Rjixl, = 0 and 


(7.1) Roux = Kh ex . 
The scalar 
(7.2) K = 4Roishre™ 


is called the curvature of the Finsler space.” 

From the permutation formulas (3.1), (3.2) we can now derive Cartan’s permu- 
lation formulas,” which are fundamental for the two-dimensional Finsler space. 
Above all we have from (4:5) and (4.6) 


(7.3) Aixlo = Ishih’hy ; 


Now let ® be a function positively homogeneous of degree zero in 2’. From 
(3.1), (3.3) and (7.1), (7.3) (6.2) it follows that 


(7.4) Dinu — Paw = —PoeKex, 
(7.5) Poy \e — Piles) = —Pe(T hihi, + hs ° 








'S E. Cartan, [6], p. 36. 
‘' K was introduced by A. L. Underhill, [16]. 
'S E. Cartan, [5], p. 121; see also P. Funk, [11]. 
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By going back to the definitions of ®,, &, 9, we obtain with regard to (5.5) 
and (6.5) Cartan’s permutation formulas 





Sin Dyy, = —®, — Ie, al 1,85, | 


7.6) Be, = cc = — ®, 
| Po — Poo = — Ky. 





From (7.6) and from Jacobi’s identity 
(7.7) {(®ss0 — Pov) — (Lowe — Povs)} + cycl. = 0, 


where + cyel. stands for the terms which arise by cyclical permutation of s, b, 8 
from the term written, we obtain, by cancelling a factor 3, the “Bianchi” 
identity” 





(7.8) Iu + IK + Ko = 0. 





8. The tensor /'G},,. Affinely connected Finsler spaces. In the following 
we put, for the sake of simplicity, 
_ Gi FF = 
dx? dar"* Ax'* posi dx’? Ax’* Ax!" Ax" 





(8.1) 


i 
jkrt 


In consequence of the homogeneity of G' we have Giur = Gior = Ghio 
Therefore if we decompose the tensor FG},, in components according to the 
vectors (I) and (h), there will appear only terms with I'h;h,h, and h'hjheh; . 

By carrying through this decomposition, we find first from (2.19), (2.20), 
(7.3), (6.2) 


(8.2) FGine = Vje tir + (Leh‘hjha)ohe « 

With regard to (6.5) we obtain 

(8.3)  (L.h'hihso = (Io + Ls)h‘hjhy — Ie(Uhjhy + hil je + A'hjle). 

On the other hand, we have from (2.8), (2.17), (4.6) 

(8.4) Dye le = OV jnlle — Qh'hehwD je 

From (2.8), (7.5) and (4.6), (5.4) we get for the first term on the right of (8.4) 

(8.5) g' Tyrie = (Uo — IT,)h‘hj. — To(Uhihe | | : 
— h'ljh, — h'hjlh.) he + 2h hha jr’. 


Hence we obtain finally 


(8.6) | FGie = [—201' + (eo + Li)h'Thjheh, «| 











‘6 It appears first, expressed otherwise, in L. Berwald, [1], formula (78), p. 206. 
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In particular we have 
(8.7) FGjxr = (Iso + Iv)hjhe . 
By operating on 0) WOM 0 = it follows with regard to (1.9), (6.5) 
that 
F’Gheme = (Iso + Io)o + 20(Ls0 + Iv)]hjhihm 

— (Iso + Io) (Ljhihm + hglihm + hghalm). 
A Finsler space is said to be affinely connected, if G’ are quadratic polynomials 


inz’. From (8.6) it is seen that the two-dimensional affinely connected Finsler 
spaces are characterized by 


(8.8) 


(8.9) I,= Il, =0 


According to the third permutation formula (7.6) we have for these spaces 
either K = 0 or J = const. From (8.2), (7.2) we see that, for an affinely 
connected Finsler space with K = 0, the Gj, , which are functions of position 
only, can all be transformed together to zero by a transformation of coordinates. 
Then it follows from (2.7) that F does not depend on x. The converse is evi- 
dent. Hence the affinely connected Finsler spaces with K = 0 are identical 
with the Minkowski spaces. The two-dimensional affinely connected Finsler 
spaces with J = const. were determined by the author. In §14 we come back 
to these spaces.” 


II. NECESSARY AND SUFFICIENT CONDITIONS FOR TWO-DIMENSIONAL FINSLER 
Spaces WitTH REcTILINEAR EXTREMALS 


9. Differential equations for the vectors (/) and (h) in a two-dimensional 
Finsler space with rectilinear extremals. The definition of a Finsler space with 
rectilinear extremals was given in the introduction. In the coordinate system 
mentioned there, the differential equations of the extremals are 

tke ak nt ym dx” wm >) 
(9.1) ax! x za!" = 0, (: = pal = ee) 
For a two-dimensional Finsler space there exists but one such equation. In 
this section, we seek conditions, invariant under transformation of coordinates, 
which are necessary and sufficient in order that a two-dimensional Finsler space 
may have rectilinear extremals. 

First we suppose that the considered Finsler space has rectilinear extremals. 
Let x" be coordinates for which the extremals are given by (9.1). It is evident 
that (9.1) is equivalent to 


(9.2) G = F'ql', 





‘7 Cf. L. Berwald, [1], pp. 207 f., 212 f., 215 ff.; E. Cartan, [5], p. 134 f. 
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where q is a function of z, x’, positively homogeneous of degree zero in x’. The 
The basis of our considerations is the system of equations (5.5), (6.5). On 
account of (9.2), it can be written as 


(9.3) li = -2ql', Is = —qol'— gh’, Is =h'; 

7 hi = —qol' — gh’, hs = —(9 + Igo + ool’ — (2q0 — I.)h’, 
ai a 

In order to derive (9.3), (9.4), we can proceed from the first and the last equa- 
tion (9.3). Then the second permutation formula (7.6) gives the second equa- 
tion (9.3), which is identical with the first equation (9.4). By the same pro- 
cedure we obtain the second equation (9.4) from the first and the third. 

Let us now consider the conditions of integrability of (9.3), (9.4). Two of 
them are identically satisfied, in consequence of the derivation of (9.3), (9.4) 
just given. The condition of integrability, which follows from the permutation 
formula for lis — 15, , is also identically satisfied. Each of the other permuta- 
tion formulas gives two conditions of integrability, since the coefficients of I' 
and h' must be respectively equal in both members. Thus we obtain six condi- 
tions of integrability: 


(9.5) qos = 2qo + Qq0, 
(9.6) ¢ +4 = —K, 
(9.7) goo = (¢ + Igo + qoo)s — 4(q + Igo + qos) + Igo — K — Q5, 
(9.8) qqs — @ + 2q0, = In. + KI = —K,5, 
(9.9)  (¢ + Lge + qoo)o + 21(q + Iqs + qov) + 21, = 0, 
(9.10) 3(q + Iqs + qos) = Ins + In. 
In (9.8) we made use of the identity (7.8). 
10. Reduction of the integrability conditions. In the first place, we show 


that the conditions of integrability (9.5)-(9.8) can be substituted by the two 
equations 


(10.1) K = -¢ -«@, 
(10.2) Ks = —3(q + qs), 


the second of which is found by substituting the value of qos from (9.5) in (9.8). 

Indeed, (9.5), (9.7), (9.8) follow from (10.1), (10.2). For, by substituting 
the value of Ky from (10.1) in (10.2) and making use of the second formula (7.6), 
we obtain (9.5), and by substituting the value of 3(q + qqgs) from (9.5) in 
(10.2), we get (9.8). In order to derive (9.7) we differentiate (9.5) with respect 
to 3 and apply the second permutation formula (7.6) for @ = qo. Thus we find 


Joss = 2Qv8 — Gov + q + qqve = 2(qos — gov) + Goo + q5 + qq . 





Now 
(qq + 
and ( 

11. 
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Now apply the first permutation formula (7.6) to qs — qs» and form 
(q + Iqe + qoo).. Then equation (9.7) results easily, in consequence of (9.5) 


and (10.1). 


11. Necessary and sufficient conditions for rectilinear extremals. From $10 
we see that the conditions of integrability (9.5)—(9.10) of the system (9.3), (9.4) 
of partial differential equations may be reduced to (10.1), (10.2), (9.10), (9.9). 
The first three of these equations can be written as 


(11.1) qs = i = q, 
(11.2) gq = —3Kos — 9%, 
(11.3) gos = 3165 + In) — Iqs — @. 


The equations (9.10), (9.9) are equivalent to (11.3) and 
(11.4) (Isa + Tb) + 27 (160 + Iv) + 62, = 0. 


(11.1)-(11.3) are a system of partial differential equations for the function q. 
In consequence of the permutation formulas (7.6), the conditions of integrability 
of this system are obtained by calculating the expressions 


A= dsb — Qbs + Kqs ’ 
B 


Qisd8 — Qids — Qed, 


C = deve — Goon + dos + Iga + 1sQ00 


with the aid of (11.1)-(11.3) and by equating the results to zero. The calcu- 

lation of A, B, C is simplified by the remark that q», is given by (9.5), go» by 

(9.7), if @ , ¢g + Igo + goo are respectively replaced by their values (11.2), (9.10). 
A simple calculation shows that A = 0 reduces to 


(11.5) Ko, — 3K, = 0. 
For B, we find first 
B = —2[Kos + (Iso + In)s + [Ko]. 
Since the permutation formulas (7.6) give 
Is0s + Tvs = Isso — Ip + Toe = Iso + KIo , 
we have further 
B= —32(Ko + IK + Ins)o. 


Because of the Bianchi identity (7.8), B is therefore identically zero. For the 
calculation of C, the second permutation formula (7.6) gives 


(Isa + Ib)sa — (ea + Ie)o = (Lea + Io) - 








ot sale 
2 
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By means of this formula, we obtain 
C = 4[(s0 + Lo)o + 22 (Lea + Ir)]s 
— 4q{((Iso + Ivo + 20(I00 + Iv) + 61.] — 2Ko — 21K, 
or, with regard to (7.8) 
C = (Lo + Io)o + 2(Ie0 + Ie) + 61ele 
— 4q[(Io + I)o + 21(Le9 + Iv) + 61,). 


Consequently C is zero, if (11.4) holds. Hence we have: 

If (11.4), (11.5) are satisfied, the system (11.1)—(11.3) 7s completely integrable. 

It is evident that the equations (11.4), (11.5) are necessary in order that a 
two-dimensional Finsler space may have rectilinear extremals. They are also 
sufficient. For let (11.4), (11.5) hold. Then the system (11.1)—(11.3) is com- 
pletely integrable. For a solution q of this system, all conditions of integra- 
bility (9.5)—-(9.10) of the system (9.3), (9.4) are satisfied. But the first equa- 
tion (9.3) or 


(11.6) ai, + 2qri = 0 


states that the considered two-dimensional Finsler space has rectilinear ex- 
tremals. Therefore we have: 

A necessary and sufficient condition in order that the extremals of a two-dimen- 
sional Finsler space be rectilinear is that the equations 





(11.4) (Isa + In)o + 20(Ie0 + Iv) + 61, = 0| 
and 








(11.5) | Ko, — 3K, = 0| 


be satisfied.” 





12. Geometrical meaning of (11.4). In the foregoing paragraphs, we es- 
tablished the conditions (11.4), (11.5) in a purely analytical way which does 
not show the independent geometrical meaning of each of these conditions. We 
give therefore still another deduction, which does not possess this disadvantage. 

For that purpose we insert some general considerations, restricting ourselves 
to the case of two dimensions, for the sake of brevity. Let a two-dimensional 
manifold with coordinates x‘ be given and a differential equation 


x! (a! ry 2G (zx, x')) — a!*(a!” + 2G" (a, z’)) = 0, 
(12.1) 





18 Cf. P. Funk, [11]. 
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where the functions G"(a, x’) are analytic and positively homogeneous of the 
second degree in x’. (12.1) defines a system of general paths x‘ = 2'(t) in the 
manifold. This system remains unaltered under the transformation 


(12.2) G(x, x’) = G'(a, 2’) + plz, x')x" 


where p is analytic and positively homogeneous of the first degree in 2’. * We 
call (12. wi a _ projective change of the functions G'. A transformation of coordi- 
nates t = Z'(x) transforms 24" in 


* i Fe i ’ *h 
dx! dx* Axi 
The second and the higher derivatives 
12 4) Gg _ ps aG' ths ag 4, ’ 
(12. ariax’e — 7k axiar*ar’e — Tike wietegeag= = iam 


therefore transform respectively like the coefficients of an affine connection 
(depending upon the line-element), and like tensors. To the projective change 
(12.2) of G corresponds respectively the ata change 


i i 0 ‘ 
(a) G; ik = Gir T 5 a , Of + <P pas 8; i + sain x", 
i i a ma ON ap é 
(12.5) 4 Gre = Gre + ariax’* bs s+ snes ae 6; 
(b) 3° 3 
Psi P ri 
+ Ox'* Ax’ be Ox"! Ax’* ax’ 





of the affine connection G}, , and of the tensor Gis . 

It is evident that the extremals of a Finsler space form a system of paths, 
r being defined by (2.4) in this case. Now the meaning of the condition (11.4) 
can be expressed as follows: 

A necessary and sufficient condition in order that in a two-dimensional Finsler 
space the functions G', defined by (2.4), can be projectively changed in such a way 
that the transformed functions G' be quadratic polynomials in x’, is that equation 
(11.4) holds. 

Proor: In the first place, we consider a two-dimensional space with a system 
of paths (12.1). Let G’ be quadratic polynomials in x. Then the right member 
of (12.5b) is zero. Now we have from (12.2), in consequence of the homo- 
geneity of p 


(12.6) ~~ 5( 35 7 27). 


‘* We suppose here positive homogeneity with regard to the following applications to 
Finsler spaces. In general, mere homogeneity is supposed. 
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By substituting this value in (12.5b), we find as a necessary condition for 
G@ = Gij(x)r"x” 


(12.7) ike _ (GT 08 + Gresd} + Tre idk) — 1G? net” = 0. 


Conversely, if (12.7) holds, we put 


(12.8) p = 1 (20:.a)2" - 25), 
where Gj; = Gj, are arbitrary functions of x alone. Then from (12.7) and 
(12.5b) we have Gi, = 0. Therefore (12.7) is sufficient too for G = 
Mt i(ae)ara!?2 
Now consider in particular a two-dimensional Finsler space, and let G' be 
given by (2.4). Then, by multiplying (12.7) by —3F and taking account of 
(8.6)—-(8.8), (4.1b), we obtain (11.4) multiplied by U‘hjheh, . 
We can give the theorem just proved the form: 
A necessary and sufficient condition in order that the extremals of a two-dimen- 
sional Finsler space form a quasigeodesic system of curves ts that (11.4) be satisfied.” 
It is easy to prove this form of the theorem directly. We have to suppose 


that for the extremals x’'r’" — 2’*zx’’* is a homogeneous cubic polynomial in 
x’, or that 


(12.9) a"G* — 2"G' = 10}, (a)a"r"2". 


If we differentiate (12.9) with respect to x”, 2’?, x’, x”, we obtain an equation 


which reduces to (11.4), multiplied by Ga hjhyh,h, , as can be seen from §8 
and §4. 


13. Geometrical meaning of (11.5). Before considering (11.5), we observe 
that the system of functions 
7G" 7G 


13.1 il a, ie ll 
in * axlidxt — Ax! ax! Gir 


a” 


Ox’* 


i 0G 
+ Gir ax’’ 


where G' are the functions which enter in (12.1), behaves by a transformation 
of coordinates like a tensor, but is positively homogeneous of the first degree 
in x’. We call Kj, the fundamental curvature tensor of the connection Gi: . 
For a two-dimensional Finsler space, where G' are defined by (2.4), the funda- 
mental curvature tensor is connected with the curvature K by 

(13.2) Ki, = FKh'‘ej., 


as (3.2) and (7.1) show. 





20 See J. Douglas, [7], p. 157 f., for n = 2 dimensions. 


*1 Cf. P. Funk, [11]. For the name “‘quasigeodesic system of curves’’ see W. Blaschke 
and G. Bol, [4], p. 245. 
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Now the geometrical meaning of (11.5) is expressed by the theorem: 

In order that there may exist a projective change of the G’s defined by (2.4) which 
transforms the fundamental tensor of curvature of a two-dimensional Finsler space 
to zero, it is necessary and sufficient that equation (11.5) be satisfied. 

To prove this theorem, we first establish a necessary and sufficient condition 
that in a two-dimensional manifold, bearing a system (12.1) of paths, the 
fundamental curvature tensor may be transformed to zero, by a projective 
change of G'. Then we suppose that the considered manifold is a two-dimen- 
sional Finsler space, and that (12.1) is the system of its extremals (2.2), and we 
show that the necessary and sufficient condition we found reduces to (11.5) if 
we introduce the curvature K and its derivatives with respect to s, b, 3. 

Let (12.1) be the system of paths of a two-dimensional manifold. We denote 
the covariant derivative with respect to the Gj, by a semicolon; for instance: 
_ OT; Sie oT; dG 


7 Tr 
— — T', jk- 


~ Oak Ox’? Ox'* 





(13.3) T isk 


Let Kj, be the fundamental curvature tensor of the connection Gj, obtained 
from the connection Gj; by the projective change (12.2). Then we have 


a i 1 a(p’)\ «i 
Ris = Ki — (pu — 5 °@ Jat 


(13.4) ; 2 
O(p ; OD;; OD: \ yi 
a ( ~ 3 ea ™ (328 ni Pe) j 





Hence the postulate Kj, = 0 gives 
SF ius 1 a(p’)\ 1 a(p’)\ . Op:; — ODe\ 4 
(13.5) j= (»; - 2 ari )a Keg (ps = 2 Ox'* 6; + dale _ az’ z. 


Suppose now that (13.5) is satisfied. If we put 





OK, _ 


(13.6) Kj == K;, aa ea K;, , 





we obtain by differentiation and contraction the system of differential equations 
(13.7) D:; — 1 a(p’) = 2K; + 4Kjr2x”. 


for the function p. The condition of integrability of (13.7) is 


caidas aie) 1 (32?) 
(13.8) Piisk D3k33 2 ( ox'i “ + 2 ax’* <j 


= 2(Kjx — Kus) + (Kirt) x — (Ker2”);J- 
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In consequence of the permutation formulas 


i Oo 


Ob\ OP 0 
( dx’ sk Ox"? : 


and of the homogeneity of p, we find for the left member of (13.8) the value 


dp ses ( _10p)\,: _ 1a(p’) j — (224 — 2 i 
am in— (pu 5 en) + (Po — 3 Gen) — (30H Gor) 2"f 


which is zero, because of (13.5). Hence the condition of integrability of (13.7) 
becomes finally 


(13.10) 3(K ix — Kui) + 3(Kirt”) se — (Kurt’”);;] = 0. 


Conversely, let (13.10) be satisfied. Then the condition of integrability of 
(13.7) leads back to (13.5). Hence (13.10) is a necessary and sufficient condi- 
tion in order that the fundamental curvature tensor of a two-dimensional 
manifold with a system (12.1) of paths can be transformed to zero by a projec- 
tive change of G’.” 

If the curves (12.1) are the extremals of a two-dimensional Finsler space, we 
have with regard to (2.6), (13.3) and to the symmetry Gi, = Gi; 


(13.11) T i — Teg = Tiay — Trp. 
Further it follows from (13.2), (4.4), (6.2), (6.5), (4.1) that 
(13.12) K;=FKl;, Kj = Kory +Kelh;, Ky!” = F(Kl; + Koh). 


85:4 — P35 = — 
(13.9) 


If we introduce, by means of (5.2), the derivatives with respect to s and b 
instead of the derivative ----«, we obtain in consequence of (2.7), (5.4), 
(4.4) and of the symmetry Gj, = Gj; 


(13.13) aa — Ky = FRG — hj) = FReex, 


(Krj2")y — (Kner) = F(Ky — Kos) ex - 
By substituting these values in (13.10) and by dropping the factor —3Fe;jx , 
we get (11.5). 


From the theorems we proved in §§12, 13, we can also easily see that the 
conditions (11.4) and (11.5) are necessary and sufficient in order that the 





2 In consequence of these considerations and the corresponding ones in §12, we have 
incidentally the theorem: 

The conditions (12.7), (13.10) are necessary and sufficient in order that a given system of 
paths (12.1) of a two-dimensional manifold be equivalent by point transformation to the straight 
lines of a flat projective space. 

Cf. also the end of §13. For n > 2 dimensions, J. Douglas, [7], p. 162 established the 
corresponding theorem, which is not however applicable when n = 2. 
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extremals of a two-dimensional Finsler space be rectilinear. For let the ex- 
tremals be rectilinear. Then, in a suitable coordinate system, (9.2) holds. 
Hence we obtain from (12.2) G* = 0, by putting p = —qF. Therefore (12.7) 
and (13.10) or, what amounts to the same thing, (11.4) and (11.5) are satisfied. 

Conversely, if (11.4) and (11.5) are satisfied, the coefficients G}, of the pro- 
jectively transformed affine connection (2.19) are functions of position only. 
dK}, 
ox" © 
(11.5) holds, this tensor is zero. Therefore the Gj, can altogether be trans- 
formed to zero by a suitable transformation of coordinates z* = 2‘(%). Then 


Since 





is easily seen that the curvature tensor of the connection Gi, is 
t j 


x. 
(13.14) jz = 0, = 62 tigi = (0, 


Hence, in the new coordinate system, the differential equation of the extremals 
has the form 
pi pp # ke p pi 
(13.15) zg" — z*7"" = 0, 
that is, the extremals are rectilinear. 


III. THe Two-pIMENSIONAL FINSLER SPACES WITH RECTILINEAR EXTREMALS 
THE MAIN SCALAR OF WHICH Is A FUNCTION OF PosITION ONLY 


14. The two-dimensional Finsler spaces with constant main scalar. We shall 
now determine all two-dimensional Finsler spaces with rectilinear extremals 
the main scalar of which is a function of position only. In order to avoid 
interruptions later on, we first reproduce briefly the manner in which all Finsler 
spaces with J = const. can be determined,” and we add a simple remark on 
these spaces. 


The Finsler spaces with J = const. belong to the class of Finsler spaces for 
2 


ue ; ed ‘ : ; 
which —- is a quadratic polynomial in zx’. In fact, if we differentiate 3 


three times with respect to x’ and if we use (1.9), (2.14), (6.2)-6.5), we get 
successively 


(14.1) 2 Fie a's ed, 
dx" /g V9 ( a 
a” F’ 
14, i; Pe 2 pe ie 
“eS Perm (= )= 2 Te [Ble — L(lsle + hls) + (2 — To) hihal, 
P Fg F’ 1 
14. SSE ae one = — one : 
us dx" da"* Aa’ (=) Fg (Too + IIo) hsheh 


The two-dimensional Finsler spaces, for which 


F i 
(14.4) 7 = oe (x)a"x™, (Gin = Guid, 





* L. Berwald, [1], p. 215 ff. 
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are therefore characterized by I55 + JJ» = 0. With regard to (14.2) and 
(4.1a), we have from (14.4) 


(14.5) eat) = <n (gu — 31 (leh + leh) — Fiohihe}, 
Vv 


and from (14.5) 
(14.6) *— 4y, 


where g = det. (gi). 
Suppose now in particular J = const. Then we have from (6.3) 


(14.7) vg = 


since we can choose in (6.3) g(x) = 1 without loss of generality for the following 
considerations. Therefore it follows from (14.4) that 


(14.8) F = Vour"r" e”. 


Now it remains to determine # as function of xz’. For that purpose we dis- 
tinguish the three cases 
(14.9) g=1-H'S0, 


and understand by a:v"’, B;"’ two linearly independent Pfaffians so that A = 
a8. — a8, > 0. We put in the case 
(a) I’ < 4: 


(14.10a) gnc a" = A [(aix”)? + (B:2")’); 
(b) f° = 4: 
(14.10b) aoc (aa"’)? ; 


(c) I’ > 4: 
(14.10¢) Gur ae” = a (a;x") (B,x”). 


Because of (6.1) and (14.1) we have 


(14.11) i = fox = #"da" 
Gin (x) x" x im = 





By integrating, we get respectively in the cases (a), (b), (e): 


(14.12a) o= va arctg — sf ib V(x), 


(14.12b) v= ae, 7 + ¥(2), 
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. iy 1 Bx" 
(14.12¢) o= i764 log a + (2), 


where y(x) is an arbitrary function of position. By substituting these values 
in (14.8), we obtain finally, by means of (14.9)-(14.12), the following types of 
two-dimensional Finsler spaces with I = const. 





. ny2 pi) 23 a; eretg fee 
(14.18a) I <4 : F = ((a2") + (B;2"))%e at 

2 7) ; ale 
(14.13b) I == 4 “4 F = (a;x Je az : 

‘ .4#(1- = mie: Se 
(14.13¢) r > 4 = F = (a;x"") ( 7A), 2') ( =a) 


In consequence of (14.10) and (14.12) a multiplicative positive function of 
position would enter in (14.13). We have taken it in in a;, 6; . 

For the Finsler spaces with J = const. we have the following theorem: 

If the curvature of a two-dimensional Finsler space with consiant I ¥ 0 is a 
function of position only, the space is a Minkowski space. 

Indeed, because of J = const. the Finsler space is affinely connected (§8). 
With regard to J = const., J ¥ 0, Ky = 0, the Bianchi identity (7.8) gives 
K = 0. Hence the space is a Minkowski space (§8). 


15. Theorems on two-dimensional Finsler space with rectilinear extremals. 
In the first place, we establish some theorems which hold for Finsler spaces 
with rectilinear extremals. 

THEOREM 1. When for a two-dimensional Finsler space with a quasigeodesic 
system of extremals the main scalar is a function of position only, it 1s constant. 

Proor: From the hypothesis 
(15.1) Is = 0 
it follows, with the aid of the first two permutation formulas (7.6), that 
(15.2) Iss = I, Tos = —TI, = Il. 

By substituting these values in (11.4), we have 
(15.3) IT, + 21, = 0. 


If we differentiate (15.3) with respect to 3 and take note of (15.1), (15.2), we 
obtain 


(15.4) (d’ — 2), + I, = 0. 


The determinant of the equations (15.3), (15.4) for I, , J, is 4 — J’. Therefore 
we have either J’ = 4 or J, = I, = 0, that is, in each case, J = const. 

TazorEM 2. A two-dimensional Finsler space with rectilinear extremals for 
which I” is a function of position only and # $, has constant curvature. 
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Proor: With regard to theorem 1 we have by hypothesis 
(15.5) (a) I = const., (b) Ke, = 3K. 
Because of (15.5a) the Bianchi identity (7.8) gives 
(15.6) Ks = —IK. 


In consequence of I = const., it follows from (15.6) that 
(15.7) (a) Kos = —IK, , (b) Ko = —IkK,. 
By substituting the value of Ky, from (15.5b) in (15.7a), we find 


(15.8) IK, + 3K, = 0. 


If we differentiate (15.8) with respect to 3, and make use of (15.5), (15.7) and 
of the first two permutation formulas (7.6), we obtain 


(15.9) 3K, + 21K, = 0. 


The determinant of the equations (15.8), (15.9) for K,, K, is 27° — 9. Ae- 
cordingly we get from (15.8), (15.9) K, = K. = 0, when I’ # 3. Finally, the 
last permutation formula (7.6) gives KKs = 0. Hence K = const. 

As a corollary of theorem 2 and the theorem of §14, we have 

THrorEeM 3. A two-dimensional Finsler space with rectilinear extremals for 
which I’ is a function of position only and 40, & is a Minkowski space. 

For such a space is K = 0. 


16. The exceptional case J” = 8. We shall now determine the two-dimen- 
sional Finsler spaces with rectilinear extremals for which J’ = 8. Let us begin 
with some general remarks. 

We write in the following 
(16.1) a=2, a¢=y, 
and we denote partial differentiation by subscripts. It is known that the 
differential equation of the extremals of a two-dimensional Finsler space can be 
written as follows 
(16.2) Fey — Py + Fi(a2’y” — yx”) = 0, 
where 
Pyy _ Py 


16.3) Fae Oe os OY og CED 
( ( ? ax’y’ (a’ 2 
From (16.3) we have 

(16.4) Fr. —¥ =6 


as a necessary and sufficient condition for rectilinear extremals. By differenti- 
ating (16.4) with respect to x’ or y’, we obtain, in consequence of (16.3), the 





Sin 
equ 


(16. 
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necessary condition 


- oF, “5 > 

Now let us consider a two-dimensional Finsler space with /* = $. With 
regard to (14.13c) and (16.1), we have for such a space 

(ax’ + By’)’ 

> SO 9 6 — . 
(16.6) va’ + by’ (ab — By ~ 0) 
We multiply the numerator and the denominator by (a6 — By)! and put 

i Pie i in on 

7 (a5 — By) (ad — By) 

(16.7) 
Y _ 6 


> <nneiflieataniaes Dp = , 
(a5 — By)! (ad — By)! 
Then we obtain 
(Ax’ + By’)’ 
Cx’ + Dy’ ’ 





(16.8) F = 


where 


(16.9) AD — BC = 


Without loss of generality, we can suppose D ~ 0. For, if D = 0, C ¥ 0, 
we arrive at D # 0 by permuting x and y. Then we have 





_1+8C 
(16.10) A= D 
and 
1\2 
(16.11) F = hee ee + 2Br’ + B’(Cx’ + by)}. 


We start with the form (16.11) of the fundamental function F and firstly make 
use of the necessary condition (16.5). For the function (16.11) we have 


2 


Consequently condition (16.5) runs as follows 


aC, »2 , (AC , aD 2 
16. pit 4\2 pried ip , -— 
(16.13) ap @) + (2 + PY ay +o =") 


(16.12) F, = 


Ox 


Since C, D are functions of position, (16.13) a into the system of differential 
equations 





(16.14) eae ae a =O 
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which gives, by integration, 

(16.15) C= -ay+e, D=ar+d (a, c, d constants). 
Now we distinguish two cases: 

(a) a=0. 

(16.15a) 

(b) a € 0. 

(16.15b) C = —a(y — yo), D = a(x — 4%). 

In case (a) we make the transformation of coordinates 

(16.16a) f=2, g =a (cx + dy) 

and put 


(16.17a) = Be, y) = 78(2,8 ~ 2) = 2(#, 9). 


By substituting in (16.11) and dropping the bars, we obtain 


)\2 ! 1\2 
(16.18) F = er + 22a’ + 2y' = eta ‘ 
| 











In case (b) the transformation of coordinates 


(16.16b) i g=a 7—®* 
— 40 

gives 
</ 


x 
age 


1 7 
C=- Cx’ + Dy’ = = 5. 


Further we put 


" 1 1 1 1 97 cal 
(16.17b) — B(a, y) = -2a(} + Xo, az + w) _ 2(Z, 9). 


By substituting these values in (16.11) and dropping the bars, we recover 
(16.18). 

The fundamental function (16.18) satisfies the necessary condition (16.5), 
but not yet the necessary and sufficient condition (16.4). This condition gives 
for z the differential equation 





(16.19) pon wae 
Ox ( 











(16.19) has the obvious solution z = const. which gives a Minkowski space. If 
z is not constant, integration of (16.19) leads to 


(16.20) lz + yz = W(z),| 


where W is an arbitrary function of z (which we suppose to be analytic). 











trans} 
tion ¢ 
funct 
In 
thele: 


17. 
K of 
funct 


(17.1 


intro 


Furt! 
(17.4 
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Hence we have: 

The fundamental function of a two-dimensional Finsler space with rectilinear 
extremals and I? = % which is no Minkowski space, can be brought by a suitable 
transformation of coordinates, to the form (16.18), where z is a non constant solu- 
tion of (16.19). These solutions are given by (16.20), where W(z) is an arbitrary 
function of z. 

In §17 we shall see that non constant solutions of (16.19) exist which never- 


theless give a Minkowski space. 


17. Space curvature in the exceptional case. We calculate now the curvature 
K of a Finsler space with rectilinear extremals and J° = 8, using (11.1). The 


function 


1 a 
(17.1) I= LG 
introduced by (9.2), is connected with F by 
‘ . Bian 
(1 ( 2) q= oF? art Mi 


as is seen from 
, dF _ oF x" hy aes 


Further we have from (9.2) 


(17.4) ae a 


des = all 


if we note the homogeneity of q in the 2’. 
Now let F be given by (16.18), (16.19). Then we substitute, with the aid of 








2 2 
(16.19), the partial derivatives = ; ae for the derivatives “: * =: 
2 2 2 
(17.5) 2 gS Si awe (). 
oy Ox OyOx Ox" Ox 


Moreover it follows from (16.20), by partial differentiation with respect to z, 
that 
(17.6) 08425 Me - yi OO 

dx W'(z) — y’ dx? (W’(z) — z)*’ 


where the primes indicate differentiation with respect to z. Thus we get 


ary és ts * Oz at F. SZ a (5 )(2) 
x’ + zy’) dx’ a x’ +2y') dz x’ + 2y') \dr}’ 


and by substituting these values in (11.1) 


17. __ v’® y’ )' 
om K (W’(z) — y)® (4) 
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From (17.8) we see that K is then and only then a function of position only, 
L— XH 
a te 
¥Y— Yo 
yo constants). With regard to the theorem of §14 and to the remark on the 
case 2 = const., made in §16, we have: 

When, and only when, the function z which enters into the fundamental function 
(16.18), (16.19) of a two-dimensional Finsler space with rectilinear extremals and 
I’ = &, has one of the values 


when ¥ is linear in z._ In this case we obtain from (16.20) z = — 


tl 
y — yo’ 
the space is a Minkowski space. In every other case the curvature of the Finsler 


space depends on the line-element. 
For z = 2 the transformation of coordinates 


(20, Yo, 20 constants) 


2= ih, t= 


(17.9) E=zr+my, gJ=y, 


for z = —”™° the transformation 
Y — Yo 
_ £- 2% in 1 
t= ’ = = 
Y — Yo ¥ — Yo 
carries over the fundamental function in 
(z’)” 


=f 


y 


(17.10) 


(17.11) PT « 


18. Table of all two-dimensional Finsler spaces with rectilinear extremals 
the main scalar of which is a function of position. Now we are able to set up 
a table of all Finsler spaces with rectilinear extremals the main scalar I of which 
is a function of position only, and therefore (§15) constant. For I’ ¥ 0, $ 
these spaces are Minkowski spaces (§15). For J = 0, we have K constant 
($15). Consequently the corresponding spaces are the Riemannian spaces of 
constant curvature. The case J° = 2 was discussed in §§16, 17. With regard 
to (14.13), we obtain the following table: 

(I.) K = const.. #0. I = 0. (Non-euclidean spaces). 


(1) K= 1 - Fe=k [(a’ $- y’)(a’)? was Qryx'y’ 4 (a° 4 z(yP 
k2 a? + a + y? 





2) Kat , pap l@ VOY + Qays'y’ + @ - AY? 


_ re a— — ¥? 





(k > 0; a = const. > 0). 


I.) K = 0. I = const. (Euclidean space, respectively Minkowski spaces 
with I = const. ¥ 0). 
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_ arct v 
(3) r < 4 : F a [(x’)* + (y’) eve &. 


, 
y 


‘ 


24 


4) [=4 : F=z'e 


(5) Fr >4 : F= even) yz). 


(III). K variable. P = 2. 


_ @! +2y')’ 
y’ 


(6) F , 2+yz =V(z), (W arbitrary function, ¥’(z) ¥ 0). 

This table shows that the two-dimensional Finsler geometries with rectilinear 
extremals and constant curvature K # 0 which are not Riemannian (Hilbert’s 
geometry and its generalization, Funk’s geometry of the specific metric and so 
on), have a main scalar which depends on the line-element (x, y; x’, y’). The 
same is true for the two-dimensional Finsler geometries with rectilinear extremals 
and K = 0 which are not Minkowski geometries.” 

Finally, let us remark that the spaces, enumerated in the above table under 
(I.) and (JJ.), admit a continuous three parameter group of transformations 
into themselves.” 


19. Landsberg spaces with rectilinear extremals. A Landsberg space is a 
two-dimensional Finsler space with 


(19.1) I, = 0.7 


The Landsberg spaces with rectilinear extremals are characterized by (19.1), 
(11.4), (11.5). In consequence of (19.1), equation (11.4) becomes 


(19.2) Tos = —2II, . 





y’ 
* First, we have, corresponding to J = +2, the two fundamental functions F = ae’, 
y’ 


F* = ze *’. The transformation of coérdinates = z, 9 = —y transforms F* to F, if we 
drop the bars. 

** P. Funk, [9], [10]. 

*6 For the spaces (II), (4), (5) ef. E. Nohel, [14], (especially groups 16, 20, 25), A. Mac- 
cone, [15], E. Cartan, [5], p. 135. In these papers, the space (II), (3) does not appear, 
because they take as starting point the complex domain, where the types (II), (3) and (II), 
(5) are not different. All spaces (II), (3)-(5) figure in S. Lie and F. Engel, [13], p. 435 ff.; 
but their ds is not there given. 

*7 These spaces were first considered, from another point of view, by G. Landsberg, 
[12], p. 334f. See also L. Berwald, [1], pp. 208 f., 211-213; E. Cartan, [5], p. 133 f. 
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From (19.1) and (19.2) we obtain, with the aid of the permutation formulas 
(7.6), the formulas 

In = —I], Ip, = —h, 

Ion = (-In + - DH, Io, = 31h, 
Ison = (—Ioo + 319 —P +4DI,, Toone = (400 — 71’ + DI, 
sven = (—Lov0 + 4110 + 32, — 61h + 816 +  -11P + Dh, 


(19.3) ¢ 





which we shall use later on. 

We desire to determine ail Landsberg spaces with rectilinear extremals. For 
that purpose we state first the following theorem: 

The Landsberg spaces, the extremals of which form a quasigeodesic system of 
curves (§12), are identical with the two-dimensional affinely connected Finsler 
spaces. (§8). 

Proor: A Landsberg space with a quasigeodesic system of extremals is 
characterized by (19.1), (19.2). In consequence of (19.1) the Bianchi identity 
(7.8) reduces to 


(19.4) Ke = —IK. 
The third permutation formula (7.6) gives 
(19.5) Ih, = KI5. 


Now apply the second formula (7.6) to@ = I,. On account of (19.2), (19.5), 
(19.4), it follows that 


(19.6) ly = Snip ~ Iie * Millan: 4 Slade. 
Further we get from the first permutation formula (7.6) 

Teo = Ivv9 + Ive + IT . | 
When we calculate both members by means of (19.2), (19.4), (19.6), we obtain 
(19.7) 215 + K(Iove + 31199 + 13 + 20°Ty + Is) = 0. 
Differentiating (19.7) as to # and using (19.2), (19.4), we find 
(19.8) K[Iss00 + GITo00 + (5I9 + 110? + 1)Ig9 + (719 + 62° + 3)II5] = 0. 


If K = 0, it follows from (19.7) that J, = 0. The space is a Minkowski space, 
because of J, = I, = K = 0 (§8). 

Now let K ¥ 0 and differentiate the second factor on the left of (19.8) with 
respect to b. With regard to (19.3), it follows that 


(19.9) In(Ioo0 + 31T9o + 5 + 2FIy + Ig) = 0. 


If I, = 0, the space is an affinely connected space with J = const., on account 
of (19.1) and K = 0 (§8). If 
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(19.10) To00 + 31Io8 + 5 + 21°Iy + Ip = 0, 


(19.7) gives again IJ, = 0. 

Conversely, it follows from (8.9) that for a two-dimensional affinely connected 
Finsler space (19.1), (19.2) are satisfied. 

Now it is easy to find all Landsberg spaces with rectilinear extremals. We 
must only take, from the table of §18, the spaces with K ~ 0 and add all 
Minkowski spaces. Hence we have the theorem: 

There exist but the following types of Landsberg spaces with rectilinear extremals: 
(I.) K = 0: Minkowski spaces. 


F = F(2',y’). 
(II.) K = const., # 0: Non-euclidean spaces. 








—1  . pL l@ + yey — 2xy2'y + @+27)y)1, 

() K=, : F=k eee : 
1 : [ ees *) x’)? + 2 1,,¢ +4 a ae ) "8 

@ K=-, : pr a=4 9 Fo “)y'P 


(k > 0, a = const., > 0). 
(III.) K variable. 


’ r\2 
F = eta c+ yz =y(z), (y arbitrary function, y’’(z) ¥ 0). 


PRAGUE, BOHEMIA. 
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The ten-parameter group of motions of De Sitter space is that of the real 
homogeneous linear transformations of w, 2, y, z, and t, that leave w* + 2? + 
y+ 2 — # invariant and that can be built up from infinitesimal transforma- 
tions. It contains the sub-group of the real homogeneous linear transformations 
of w, x, y, and z, that leave w? + 2? + y? + 2 invariant and that can be built 
up from infinitesimal transformations, the six-parameter group of rotations of 
four-dimensional Euclidean space. 

We seek differentiable representations by unitary matrices, in general infinite, 
of groups locally isomorphic with the ten-parameter group. Hermitian matrices 
are found for the infinitesimal transformations of representations which admit a 
transformation, perhaps to generalised matrices with a continuum of rows and 
columns, reducing the six-parameter sub-group to a product of its irreducible 
unitary representations. 


1. INTRODUCTION 


Wigner! has recently discussed the unitary representations of the inhomo- 
geneous Lorentz group. If this group is regarded as the group of motions of a 
flat three plus one dimensional Riemannian space, it may be regarded as the 
limit for zero curvature of the group of motions of De Sitter space. Dirac’ 
has considered the form that the electron wave equations take in De Sitter space. 
It is of some interest to discuss the unitary representations of the group of De 
Sitter space, which should be relevant to the quantum theory of the external 
properties of systems moving in such a space. These representations are in- 
vestigated here by the standard methods of matrix-mechanics,’ applied to the 
infinitesimal operators of the group, and it is believed that no differentiable 
representations admitting a transformation reducing the representation induced 
in the group of motions of a three-dimensional sub-space of constant positive 
curvature have been missed. The argument, however, lacks logical rigor in its 
application to the representations depending on continuous parameters. 


‘On Unitary Representations of the Inhomogeneous Lorentz Group. Annals of Mathe- 
matics 40 (1939) p. 145. 

?The Electron Wave Equation in De Sitter Space. Annals of Mathematics 36 (1935) 
p. 657. 

*See e.g. Born and Jordan. Elementare Quantenmechanik. Ch. III. Springer (1930). 
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2. THe INFINITESIMAL OPERATORS OF THE GROUP AND THEIR 
COMMUTATION RELATIONS 


te] fs) fe] 
a M = = g— 
(y dz :2), (2 


te) te) 
-(w2-23), Y 


rs] 0 
v= (12 t-— 


are a complete set of ten self-adjoint linear differential operators of the group, 
satisfying the 45 commutation relations:— 


MN —NM = .L, MZ —ZM =.iX, YN —NY =X, 
LX —XL =0, YZ —ZY 
NL —LN = .M, NX —XN =.Y, ZL —LZ =wWY, 
MY —YM=0, ZX — XZ 
WW, LY —YL = .%, XM—MX = J, 
NZ —ZN =0, XY 
UU VN —NV =W, LU — UL 
YW-—-WY=0, VZ 0, 
V, WL-LW =.V, MV—VM 
ZU —UZ =0, WX —- XW =0, 
WW, UM—MU=.W, NW-—WN =0, 
XV —VX =0, UY —YU =0, 
= 1, LT —TL =0, XT —TX =W, 
iT, MT—TM=0, YT —TY =VW, 
iT, NT —TN =0, ZT —TZ =W; 
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UT —TU =X, VW —- WV = -W, 
VT —TV tY, WU — UW = -.M, 2.23 
WT —TW = 12, UV —VU = —wW. 


The first six of these operators (2.11) satisfying the first fifteen commutation 
relations (2.21) are a complete set of operators of the six-parameter sub-group. 

To any differentiable representation of the ten-parameter group there must 
correspond a representation of these operators. If the representation of the 
group is unitary, that of these operators is Hermitian. A transformation re- 
ducing a representation of the sub-group to a sum of its irreducible unitary 
representations must reduce the representation of its operators to a sum of 
irreducible Hermitian representations. 

The commutation relations of the displacement operators for free motion ‘in 
quantum theory in De Sitter space can be reduced to the above form by change 


of phase. 


3. OUTLINE OF THE METHOD EMPLOYED 


Let the (known) representations of the operators, A = L, M, N, X, Y, and 
Z, of the sub-group be expressed in the form (in Dirac’s notation) ; 


(a | A(8) | @’) 3.1 


where B is a set of parameters taking a denumerable infinity of discrete values, 
one for each irreducible representation, and a, a’, are sets of parameters number- 
ing the columns and rows of the matrices, and taking for each 6 only a finite 
number of discrete values. Then a transformation reducing a representation 
of the sub-group to a sum of these representations reduces it tothe form == 


Syy'5ger (a | A(8) | @’) | 3.2 


where gg is the usual Kronecker 6; 7, y’, are sets of parameters numbering the 
columns and rows of the (generalised) matrices, taking for each 8, perhaps a 
continuous, perhaps a discrete set of values, perhaps no values at all,’ and 
5, 1s to be interpreted accordingly. 

If the remaining operators, P = U, V, W, and 7, of the group admit the 
transformation, they take the form 


(y, B; a| P| 7’, B’, a’). 3.3 

The 24 commutation relations of the P’s with the A’s (2.22) enable us to ex- 
press the P’s in the form 

(y | P(B, B’) | v’)(a | P(B, 6’) | a’) 3.4 


where (@ | P(8, B’) | a’) are determined, and indeed for any 8 fail to vanish only 
for at most four values of 6’. 

In virtue of the fifteen commutation relations of the A’s (2.21) and the 24 
commutation relations of the P’s with the A’s (2.22), the remaining six commuta- 
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tion relations of the P’s (2.23) follow from any one of them. This one then 
determines possible forms for (y | P(8, 6’) | vy’). 


4. THe REPRESENTATIONS OF THE SUB-GROUP 


The work is shortened if the sub-group is expressed as the direct product of 
two three-parameter groups. 


Let A*weLb +X, et = x, 
B,=M+Y, B=M-Y, 
C.=N+Z, C,=N -4Z, 


so that any one of A; , B,, and C; , commutes with any one of Az, Bz, andC;, 
and these are the operators of factor groups for either of which 


BC — CB = 2.4, 
CA — AC = 2B, 
AB — BA = 2. 
If A- B=C_, 
A+ B=(C,, 
we have CC_-—C.C = -—2C_, 
CC, —CxC = 2,, 4.3 
C_C, — C,C_ = —4C, 


and the only non-zero components of an irreducible (Hermitian) representation 
may be brought to the form 


(m|C_|m+ 1) = 2V/{Gj —- mG +m + 1)}, 
m= —j,-j+1,---,j— 1,3, i 
(m|Ci.|m — 1) = 2{G + mG — m + 1)}, 
(m|C| m) = 2m, 
where for different representations 
j = 0, 3, 1, %,---. 4.5 


Thus we may take for 8 above j; and je for these factor groups, taking any 
values from (4.5), and for a above m and mz satisfying (4.4), and the matrices 
representing L, M, N, X, Y, and Z, may be written down immediately, in 
particular 


(mi, m2 | Z(j1, j2) | mi, m2) = mi — me 4.6 


form; = —ji, —jit1,---,j1, me = —je, —jo +1, --- , je, the remaining 
components vanishing. 
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5. THe DETERMINATION OF (a | P(8, 8’) | a’) 
The 24 commutation relations (2.22) give for the operators of the first factor 


group 
(T+ W)C — C(T + W) = -(T+ W), 


(U-.W)C -— C(U- WW) =(U —- WW), ~ 
(T + W)C, — C.(T + W) =0, i 
(U — W)C_ — C_(U — WV) =0, 
(T+ W)C_ — C(T + W) = —2.U — <V), 5.13 


(U — W)C, — Cx(U — WV) = 2K(U — LV), 


and similar equations for T — «.W and U — .V with C, and C_ interchanged 
and the sign of C reversed; and for the second factor group similar equations 
with the sign of 7’ reversed. 

For (5.11) to hold it is necessary and sufficient that a component of the trans- 
formed representation of 7’ + «W vanish unless m’ = m — 3, one of U — iV 
unless m’ = m + 3, and then for (5.12) to hold that in these cases they involve 
m only in factors +/{(j + m)!(j’ — m + 3)!/G — m)!/(7’ + m — 4)!} and 
Vij — mj’ + m + 4)!1/G + m)\(7’ — m — 4)!} respectively. (5.13) 
gives then 

{(j-7) — {G+ 7 + 1? — 3} = 9, 
so that the components vanish unless 7’ = 7 — 3 or j’ = j + 3, and we must 
have for the only non-zero components, 


(A, jr, mi | T+ WI], jr — 3, mi — 3) = (A| w(jr) | VY) + mi), 

(A, jr, mi | T+ Wd, 51 + 3, m — 4) = (Al 0G) | XV — mi + 2D), 
(A, jr, mi | U — WV |X’, jr — 3, mi + 3) = —e(A| u(r) | YA (Gi1 — md), 
(A, j1,m|U— WV |, jr + 3, mi + 4) = A] off) | MV Gi + m + 2D), 


where A includes 7, j2 , and mz ; necessary and sufficient conditions that (5.11), 
(5.12), and (5.13), be satisfied, u(j,) and v(j;) being arbitrary matrix functions 
of j: for 71 = 3, 1, 3, --- , and for j; = 0, 4, 1, --- , respectively. 

Similarly, for their only non-zero components, 


(A, jr, mi | T — Wd, jr + 3, mi + 4) = (| wh) | AVG +m + DV, 
(A, jr, mi | T — Wd, jr — 3, mi + 3) = (| o'r) | VY) (j1 — ma), 

(A, jr, mi | U + WV |, jr + 4, mi — 4) = A] w'(jr) | MVC — mi + OD, 
(A, jr, mi |U+ WV |, 51 — 4, mi — 3) = — eA] o'r) | NY) (Gi + md), 


where u’(j;) and v’(j;) are further arbitrary matrix functions of j; for j1 = 0. 
> l.++., and for j, = 4, 1, $, --- , respectively. 
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Combining these results with the corresponding results for the second factor 
group, for which the components of 7’ + .W are connected with those of U + 
.V, and those of 7’ — .W with those of U — cV, we obtain the necessary and 
sufficient conditions that the 24 commutation relations (2.22) be satisfied with 
L, M, N, X, Y, and Z, given as in (4.). The only non-zero components of 
T+ W,U — WV,U + iW, and T — .W, have the form 
(y, jr, Mi, jo, me| T+ Wy’, ji — 3, mi — 3, Jj2 — 3, M2 + 9) 
= (y | fGr, 52) | vt + m1) (G2 — me) }, 
(y, ji, M1, j2,me|T + Wy’, ji + 3,m — 2y J2 — 3, me + 3) 
— = (vl 9G, de) | VAG — mi + 1)G2 — me)}, 
(y, jr, M1, j2, me! T + W |’, j1 — 4, mi — 4, jo + 4, me + 4) 
eeikle = (y | Aji, Je) | VV 1Gi + mi) G2 + me + 1)§, 
(y, ji, mi, je, me| T+ Wy’, ji + 3, mi — 43, Jeo + 3, M2 + 9) 
3 = (y | Aji, de) | eV 1G — mi + 1)(G2 + me + 1}. 
(¥, Ji, ™i, je, me|U — WV |’, ji —-3, m1 + 2, jz — 2, m2 + 2) 
= = uy [fGi, Je) | vA EG — mie — me)}, 
(y, ji, M1, jo, m2 | U—' iV |’, ji + 3, mi + 3, Jo — 3, m2 + 4) 
= 7 | 9Q1, 52) | Y)V tC + mi + IG — me)}, 
(y, jr, mi, j2,m2|U— V iy’, ji — 2, mi + 3, J2 + 2, me + 3) 
= — uy | ACh, J) | VV ts — ma)G2 + me + 1)4, 
(Y, Jr, Mi, je, me|U — WV |’, jr + 3, mi + 3, Je + 2, me + 3) 
| = uy | AC, de) | VA UG + mi + 1)G2 + me + 1}, 


) 





(y, jr, ™1,J2,me|U+ W | y’, ji — 4, m1 — 4, je — 4, me — 
= uy | f(ji, Je) | YA + mi) (Je + me) }, 
(y, Jr, Mi, jo, me|U+ WV | vy’, ji + 3, mi — 4, j2 — 4, me — 3) 
= uy |9(j1, J2) |v 1G — mi + 1I)G2 + me}}, 
(y. ji, M1, jo, m2|U+ W |’, ji: — 3, mi — 4, j2 + 4, me — 3 
= —y | ACh, je) | yA 1G + mi) (jo — me + 1)}, 
Me | U+ WV ly’, a1 + 4, m: — 2, je + 4, m — 4 
= —uy | k(ji, de) | YA E(G — mi + I)(Gj2 — me + I}, 


Nir 


(y, ja » im, Je 


(y,j1, M1, jo, me| T — Wy’, ji — 3, mi + 3, jo — 4, me — 4) 

= —(y |fGi, de) | v1 (G1 — mi) Ge + me)}, 
(y, ji, M1, Je, me| T — Wy’, ji + 3, mi + 3, je — 2, Mz — 3) 
= (y | (jr, 92) |v VEG + mi + 1)G2 + m)}, 
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(y, jr, ™1,J2, me | T — W|vy',j: — 4, mi + 3, j2 + 3, me — 4) 
= (y | h(ji, Je) |v’ EG — mi) (G2 — me + 1}, 
(y,j1, mr, j2, me| T — Wy’, jr + 3, mi + 3, jo + 3, me — FP 
= —(y | k(j1, de) | eV EG + m, + 1)(j2 — me + 1)}, 


where f(j1, J2), 9(j1, J2), AG, Jz), and k(j1, jo), are in general arbitrary matrix 


functions of j; and je, 7; taking values from 3, 1, $, --- , in f and h, from 0, 
1 1,---, ing and k, jz taking values from 3, 1, $, --- , in f and g, from 0, 3, 
1,---, inh and k, but not necessarily all such pairs of values. 


If the representation is Hermitian, f(j: , j2) and —k(j; — 3, j2 — $) must be 
Hermitian conjugates, and g(j1, j2) and h(j; + 3, je — 4) must be Hermitian 
conjugates. 5.3 

It should be noted that the factors involving m; and mz; make components 
connecting allowed values of m; and mz to those that are not allowed vanish, so 
that we need not usually take explicit consideration of the ranges of m, and m,. 


6. THe GENERAL Form or f, g, h, AND k 
The remaining six commutation relations now reduce to any one of them; 
in particular WT — TW = Z in the form 
(T+ W)(T — W) — (T — W)(T + W) = —2Z 
gives 
jf, J2)9G1 — 3, J2 — 3) = Gi + Vg, 52)fGi + 3, J2 — 4), 
Jf(hi, J)h(CGi — 2, J2 — 2) = Jo + YAGi, J2)f(Gi — 3,52 + 9), 
Jo (I1, Ja)k(Gs + 2) Je = 2) - (j2 + Ik(j1, J2)9 (1 + 3; Je + 2); 
Jha, jo)kG1 — 3, J2 + 2) = (hr + WK, j2)h(ji + 3, j2 + 9), 
forji = jij = Je — UA =A ld = fj =At 1d = jejandj = hj, 
jo = j2 + 1; respectively in 6’, B: and 
(mij2 — meajr)f(jr , jo) k(Gi — 3,92 — 4 
+ (mij2 + mo(ji + 1))g(ji, J2)h(G1 + 3, 52 — 3) 
— (mi(j2 + 1) + magi)h(ji, J2)9G1 — 2, Je + 4) 
— (mio + 1) — ma(Gr + IAG, 52)f(G1 + 3,52 + 4) = mi — ma, 
for ji: = ji, je = je: all the other components vanishing identically. 

These relations hold whenever both j:, jo, and j;, js, are possible values 
of 8, non-existent terms being interpreted as vanishing. 

Irreducible solutions of these equations are of two types; those for which the 
least value of either ji Or jo in B in any non-vanishing component of (3.3) differs 
from zero; and those in which both these values are zero. 

For the first type suppose, for example, j; < p > 0, and j; = p, for je = q. 

Then (6.12) gives two independent relations between non-vanishing com- 
ponents of g(p, g)h(p + 3, q — 3) and k(p, g)f(p + 3, q + 4), which are 
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therefore constant multiples of an idempotent matrix, and g(p, g) commutes 

with h(p + 3, g — 3), and k(p, q) with f(p + 3,q¢ + 2%). Equations (6.11) then 
show that g(p, g + Lh(p + 3,4 + 2), kp, a — DS, @ — 2), gp + 3, 
q + 3)h(p + 1, q), and k(p + 3, ¢ — 2)f(p + 1, q), are sums of multiples, 
zero for the first two, of the same idempotent matrix and of matrices whose 
product with it vanishes. Equations (6.12) then give similar information about 
g(p + 3,9 —d)h(p+1,¢—-1), 9p +1, Qh(p + 3,9 — 2), kD + 1, Dip +h 
q + 3), and k(p + 3,q¢ + 4)f(p + 1,q¢ + 1);andsoon. There are more than 
sufficient equations. If inconsistent they show that no such representation 
exists; if consistent that it decomposes into a term for which, for non-vanishing 
components, j; > p for j2 = q which can be treated in the same way, and a 
term for which each g(ji, j2)h(ji + 4, jo — $) and k(ji, j2)f(Gi + 4, Je + 3) 
is a determined multiple of the same idempotent matrix. 

This result and equations (6.11) now show that different transformations 
among y, 7’, for various j: , jz, can be taken so as to make all of f, g, h, and k, 
simultaneously diagonal, and the representation is reduced to a sum of repre- 
sentations in which f, g, h, and k, have but one row and column; if the original 
representation was Hermitian, the transformations required form a unitary 
transformation. 

For the second type suppose j; = 0 for je = gq. 

(6.12) now gives only one relation between g(0, g)h(3, q — 4) and 
k(0, a f(4, a + 4); if we make one of these diagonal, so must be the other, and 
the argument for the previous case, introducing at each stage different trans- 
formations among vy, 7’, for various j; , j2, shows that either no representation 
exists, or that it decomposes into a term for which always j; > 0, for jo = 4, 
and a sum of terms for each of which each g(j1, j2)h(j1 + 4, j2 — 4) and 
K(ji , Jo)f(j1 + 3, Je + 3) is a determined multiple of an idempotent matrix, the 
products of the several idempotent matrices vanishing; perhaps continuously 
many such terms corresponding to possibly continuously many characteristic 
values of g(0, g)h(3, q — 4). Finally as in the previous case, the representation 
reduces to a sum of representations in which f, g, h, and k, have but one row 
and column. It is this argument, involving matrices with continuously many 
rows and columns, which lacks logical rigor. 

Thus for irreducible representations, f(j1 , j2), g(j1 , Jz), h(ji , jz), and k(ji , Je), 
in (5.1) can be treated as numbers, f(j1, j2), g(j1, J2), A(ji, Je), and k(ji, je). 6.2 

Moreover the relations (6.11) and (6.12) involve either only values of 8 for 
which j; + je is an integer or only values for which it is half an odd integer, 
so that any irreducible representation involves either only integral values of 
ji + jo or only half odd integral values. 6.3 


7. THE SOLUTION OF THE RECURRENCE RELATIONS 


Equations (6.11), if g(j1 , j2)g(d1 — 3,32 — 3) and A(j1, jo — h(i + 3,42 — 2) 
do not both vanish, give 


Fi(je + 2)f(j1 , J2)k (G1 — 3, J2 — 4) 
= (fi + I) (Je — 3)fGi + 3, d2 — Kj, je — ‘D, 





so 
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so that in general 

Fj + jp) . _ 
Hj + 4) jad: + 4) 
Likewise, unless f(js, ja)f(r + 4; Je — 4) and k(jr — 4, ds — DRA, de — VD) 
both vanish for values of j; and js involved, 





fli, dks wer 3, de — 3) = — 


ween Mt = Tan eae 
oh WMH + 35 — ) = GHG a ED . 





Then (6.12) gives 
(myjo — mejr)F (jr + Jo) _ (majo + 1) — me(Jr + IFC + de + DY 




















“jlji + 3) 5292 + 9) (ji + S)(jr + 1U)(je + 3) (92 + 1) 
(mije + me(jr + 1))G( ji — je) (mi (je +1) —m(fjit WIG(AtzZ +1) 7.2 
~ (a FDA + DjeGe + D ja + DG + DR + YD 
= ™m — m 
Putting m. = m, in (7.2) we obtain 
iti) + Gat) , Fitin+0) +Gh-a-D 
delhi + 3) (je + 1)(ji + 3) 

= F(j. + jd) + Gj — pe — —1) 4 F(j + pt+tUD+G(a —- je) 
rr julie + 4) (ji + IG + 4) 


from which follows 
F(j, + jo) + Gi — je) = jogs + ¥)leGi + je) + Wi — je). 7.3 


Subtracting this equation from the corresponding equation with 7; and j2 re- 
placed by j1 + 4 and je + 3 we see that y is a quadratic function. Likewise ¢ 
is a quadratic function, so that F and G are of the fourth degree, and for (7.3) 
to hold, of the forms 


Pj + je) = — al(ja + je)® + (ja + je)? + O[(Gs + je)? + Gi + 52)] +, 
(ji — jo) = —al(jr — Je)? + (Gi — G2) PP — O[(G1 — je)? + (i — G2)] — 


Substituting these values, (7.2) is satisfied identically if a = 1/16, and we 
have 


Sli, RG — 4, je — 4) 


_(atk-DAtkh+P+tDAt+tTe-DAte+I+ 1) 7 AI 
Qjr(Qjr + 1)2ja(2j2 + 1) 





WI jd k( js + ;, je = 3) 


- (aA-k-D(pa-kt+optDAar-e-DA-kt+at ) 749 
(Qj + 1)(Qjfr + 2)Qjo(Bjo + 1) 


where p and q are arbitrary. 
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Starting with any given pair of values of j, and jz from 0, 2, , --- , for which 
these products do not vanish, we may use the recurrence relations to find the 
values of these products for adjoining pairs of such values for which the sum 
of j; and j» has the initial form (either an integer or half an odd integer) unless 
stopped by zero factors in the numerators, which must occur before reaching 
pairs of values outside the above range. In this case the separate factors of 
these products can always be chosen so that all the equations (6.11) are satisfied, 
and (5.1) yield matrices satisfying all the commutation relations. 7.5 

In the special cases in which either all the products f(ji , j2)k(j1 — 2, j2 — 3) 
or all the products g(ji , jz)h(ji1 + 43, J2 — %) Vanish, it can be verified that the 
surviving recurrence relations suffice to require the same form (7.41) or (7.42) 
for the surviving products. 

8. INVARIANTS 

For matrices of the above form 


TP+U+V2+W2- (2+ Me+NIF K+ ¥24+2) 81 


must exist, and, commuting with each matrix, be an invariant. 
Writing it in the form 


MT + W)(T — WW) + (7 — YT + WY) 
+HU+ MU- WV) +U-nU+ WI 
-HWL+XP+ (M+ VP + V+ D4 
— ML — XP + (M - YP + (WN - 24, 


we see that unless a’ = a and p’ = 8B, its components vanish, and that for 8 
and #’ given by j1, j2, the (diagonal) component has the form 


—Qijef(hr , Idk — 3, d2 — 3) + 2G + VG2gCh1 , Ja) + 2, J2 — 2 
+ 2jije + h(i, J2)9Gi — 3,52 + 3) 
— 2ji + 1)G2 + Dk, J2)f(Gi + 3,324 3) 8. 
— 2jiGji + 1) — 2j2(92 + 1) 
= §— (q+ 4)? — (+ 9), 


which must be its value for a representation determined by gq and p. 
Again 


(LX + MY + NZ)? — (LU + MV + NW)? — (LT + WY — UZ)? 
— (MT + UZ — WX)? — (NT + VX — UY)? 


must exist, and, commuting with each matrix, be an invariant. 
Writing this in the form 


K? + (KT — TK)? + (KU — UK)? + (KV — VK)? + (KW — WK)’, 


8.2 
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where K = LX + MY + NZ, has only diagonal components j:(j: + 1) — 
jojo + 1), so that we can at once write down the components of K(T + .W) — 
(7 + W)K, etc., we see that unless a = a’ and 8 = 6’, the components of (8.2) 
vanish, and that for 8 and #’ given by j:, jz, the diagonal component has the 


form 
(jx + 1) — Joe + 1))? 4+ 2jrjo(jr — J2)*F (91, J2)KCQ1 — 2,52 — 3 
— 2(j1 + WjeQhi + Je + 1)?Q(ji, J2)ACGi + 3, J2 — 4) 
— 2jije + WGi + J2 + 1)*ACHi, J2)9Gi — 3,42 +4) = 8.21 
+ 2(jr + 1)(G2 + 1)CGr — 52)*k(H1 , J2)fGi + 2, J2 + 4) 
= p(p + 1)qgq + 1), 


which must be its value for a representation determined by gq and p. 

These results enable us to show that representations for distinct pairs of 
values of g and p are distinct. (In the quantum theory of the free motion of a 
system in De Sitter space, (8.1) would be proportional to the square of the rest 
mass, (8.2) for given mass, to the negative square of the spin.) 


9. THE ALLOWED VALUES OF p AND ¢ 
In order that the matrices should be Hermitian, f(j: , j2) and —k(j: — 3, 
je — 3) must be complex conjugates and g(ji, je) and k(ji: + 4, je + 3) must 
be complex conjugates, (5.3), (6.2). Thus for all pairs of values taken by 
ji, jz, we must have 


—f(ji , jdkGi — 3,32 — 3) £0 ‘in 
g(j1, Ja)h(ji + 3,52 — 4) £ O 


Then by altering the phases, we can make 


weer, Gta Tee Puasa states) 
Id iW 2ji(2jr + 1)2j2(2j2 + 1) ‘ 


ay /f_ (are — Pi het+ P+ (A-h—-DA-AtItY 
ined / { (Qjr + 1)(Qje + 2)2ja(Qje + 1) ) 


h(ji, je * _(a-p-—p—-V(A-pt+p(A—Fe-—9—VD(A—-e+O 
7 V/ { 5. + Dp + DQ +D 


(aA tj —pt1(jat+ze+ pt 2) 
Min) = e4/ 











bs 


(Ate—-G+VD(Ate+at 2), 








(2j1 + 1)(2j1 + 2)(2j2 + 1)(2j2 + 2) 
where the radicals are taken real and positive or zero. 
Either I only integral values of ji + je, and therefore also of 71 — je, are 
involved, or II only half odd integral values, (6.3). 
[. For factors in the numerators to vanish (7.5), at least one of p, gq, —p — 1, 
or —q — 1, must be a positive integer or zero; since interchange of p and q, 
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or of p and —p — | does not affect (9.2), we may suppose p < 0 is an integer, 
Then (¢ + 4)? must be real for the expressions under the radical signs to be real, 
IA. Suppose q¢ is not an integer. Since a zero factor in the numerator of 
f(j:, jz) must occur at the lower limit of 7; + je which must be not less than 
zero, j:1 + je must take in f(j: , j2) the values p + 1, p + 2, --- , and likewise 
in k(j,, je) the values p, p + 1, --- , which must make the radicals real and 
positive. 
Thus 


(p + 32 > q + 42. 9.3 


ji + je then takes in g(j1, jo) and h(ji, je) the values p, p + 1, ---. 

Since a zero factor in the numerator of g(j:, j2) must oecur, for any allowed 
value of j; + jo, in particular j; + je = p, at lower and upper limits of 7; — j, 
which must make both 7; and j2 positive or zero, 71 — je must take in g(j1 , js) 
the values —p, —p + 1,---,p — 1, and likewise in h(j;, je) the values 
—p + 1, —p + 2,---,p, and in f(ji, je) and k(ji, je) the values —p, 
—p +1,... , Dp. 

For p > 0, therefore, j,; — j2 = 0 being a possible value in g(ji , j2) or A(ji , je), 

(2)? > (q + 3)’, 9.41 
which in fact implies (9.3). 


For the special case p = 0, this last condition is not necessary, since g(j: , j2) 
and h(ji , j2) do not take any values, and we have from (9.3) merely 


(3)? > (q + 3). 9.42 


The only integral value of g within these limits is g = 0 for p = 0. 

IB. Suppose q is an integer, which, on account of the interchangeability of ¢ 
and —q — 1 and of p and q may be taken to be such that ¢q ¢ p < 0. 

For g = p, the argument of (IA) applies, and the only possible values are 
p = 0, q = 0, which can be regarded as a special case of (IA). 

For g > p > 0, since j; + je can take only values which make the radicals 
in f(ji , j2) and k(j1 , jz) real, 71 + jo must take the values g + 1, q + 2, ---, in 
fn ’ jz) and q;, q + ., 7 ie in k(ji ’ j2), and so q, q + 1, me in g(ji ’ j2) and 
h(ji , jz); and since 7; — j2 can only take values which make the radicals in 
g(ji ,J2) and A(j, , je) real, 71 — je must take: either the values —q, —¢+ 1, ---, 
— 2, in 9(ji, J2), —=—_ + 1, = + 2, Ces ee e in h(ji, 32), and “| 
—qt+1,---,—p—1,inf(ji , je) and k(j: , j2); or the values p + 1, p + 2, ---, 
— 1, in g(ji , je), P +2,p+ 3, hie ,q, in h(ji , je), and p + l,pt+ 2, ute 
in f(ji , j2) and k(j, , j2): these results hold even for g — 1 = p > O, when there 
are no values for g(ji , je) and A(ji , je). 

For g > p = 0, the above two representations still exist with the expected 
limiting forms when g — 1 = p = 0: the limiting cases of equality in (9.41) 
above are reducible to the sum of these two representations. In addition a 
solution is possible with 7: — jo = 0 in f(ji , je) and k(j1 , 72) and no values for 
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g(jx, jz) and A(ji, J); then j; + je takes the values gq + 1, g + 2,---,in 
fly, je) and q, q i i, —— in g(ji ; je). For Maas 0, . = 1, this gives the 
limiting case of equality in (9.42) above. 

Rearranging these results to correspond to (p + 3)? < (q + 4), and de- 
scribing the representations by the values which j; + j2 and j: — jz take in 8, ’, 


we have:— 
For 
—0 < (q + 3)? < (), 


(p + 3)? = (2), (3), (D%,--- 
one representation each with 
Atj2=p,pt+i,pt+2,---, 
—pP, -—pt+l,---,p. 


ji ja 
For 
(q + 3)? = (@)’, 
(3)? > (p + 3)? < (3), 
one representation each with 
jit je = 0,1, 2,---, 
ji — je = 9. 
For 
(q + 3)? = G)’, 
(p + 3)? = (3), ()*, @*, --- 
three representations each with 


Atj=p,pti,pt+2,--:-, 


and with 
ai-j= —?P, —p + bone, =H, 
or 
ji— je = 0, 
or 
ai-ph= 1, 2,---,D. 
For 


(q + 3)? _ (3), ( §)?, ($)?, cits 
(p + 3)? = ()*, @*, @*,---, > @+ 2), 
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two representations each with 
Ati =pptilpt+2,--- 
and with 


ji-je= —P, —o $1, +--+; ~g = I, 


A—-je=Qt1,qt+2,---,p. 
II. When only half odd integral values of 7; + j2 and therefore also of j; — j, 
are involved, the same arguments as in (I) apply, but there are no special cases, 


and we have:— 
For 


—- <i t+ YF > @, 
(p + 3)? = 1, 2, 3, ---, 
one representation each with 
Atj=pptil,pt+2,---, 
ji—je= —p, —p tl, --*yp. 


(q + 3)? ¥, 2’, 3, re 
(p + 2)? 2? 3, ¥, eae a (q + 3)’, 
two representations each with 


At+j2 =p,p+1,p+2,--- 
and with 


ji — je —?P, —p + I, ---:, wg ond’ 


a-jk=at+1qt+2,---,p. 


None of these representations are matrices of finite order. If it is not required 
that the representation be Hermitian, there are many further possibilities, 
including, both for integral values of p and q and for half odd integral values, 
finite sets where j; + j2 takes the values q,q + 1, --- , p — 1, and j, — je takes 
the values —q, —q + 1, --- ,q; which are indeed complex transformations of 
the Hermitian representations of the operators of the rotation group in five 
dimensions, from which the algebraic form of (7.41) and (7.42) could have been 
derived. 

Case (I) above corresponds to one-valued, case (II) to two-valued repre- 
sentations in the De Sitter space. 
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DER DREIERSTOSS 
Von Cart Lupwie SIEGEL 


(Received June 27, 1940) 


EINLEITUNG 


Es seien A; , Ag, Az drei Massenpunkte im Raum, die sich nach dem New- 
tonschen Gravitationsgesetz anziehen. Es seien x; , y., 2% die rechtwinkligen 
kartesischen Koordinaten des Punktes A; (kK = 1, 2, 3) und m, seine Masse; 
ferner seien die Abstfnde A2A3, A3A1, AiA2 mit ™, 72, 73 bezeichnet. Setzt 


man 
(1) 


und versteht unter w;, eine der drei Koordinaten 2; , y; , 2 , so lauten die Dif- 
ferentialgleichungen des Dreikérperproblems 





m m 
th ws Me™ + msm + 1™M2 
‘1 Te T3 


dw, ou , 
(2) ms = (we = Xe, Ye, 2%} & = 1, 2,3). 
Man gebe zur Zeit ¢ = é irgend welche endlichen Anfangswerte der 9 


Koordinaten w; , fiir welche die Abstinde 7; , re, 73 simtlich grésser als 0 sind, 
und ausserdem beliebige endliche Anfangswerte der 9 Geschwindigkeitskom- 


dw, ” e ; 
ponenten ——. Nach einem bekannten Existenzsatz aus der Theorie der 


dt 

Differentialgleichungen sind dann die durch jene Anfangswerte bestimmten 
Lésungen des Systemes (2) in der Umgebung von ¢ =  regulire analytische 
Funktionen von ¢t. Wir denken uns die Lésungen lings der reellen Achse von 
t = t aus nach beiden Seiten analytisch fortgesetzt und nehmen an, dass wir 
auf diese Weise zu einem singulaéren Punkt ¢ = t einer der Funktionen wu, 
gelangen. Da die Differentialgleichungen (2) in sich tibergehen, wenn zu ¢ 
eine beliebige Konstante addiert wird und wenn ¢ durch —¢ ersetzt wird, so 
kann man t, = 0, t& > O voraussetzen. 

Wie Sundman’ gezeigt hat, bestehen bei dem Grenziibergang t — 0, t > 0 
nur die folgenden beiden Modglichkeiten: Entweder strebt genau eine der drei 
positiven Zahlen r; , re , r3 gegen 0, wahrend die beiden andern einen positiven 
Grenzwert haben, oder aber sie streben alle drei gegen 0. Wir wollen diese 
beiden Fille weiterhin als Zweierstoss und Dreierstoss bezeichnen. Der Zweier- 
stoss wurde zuerst fiir das restringierte Dreikérperproblem von Levi-Civita’ und 





'K. F. Sundman, Recherches sur le prabléme des trois corps, Acta Societatis Scientiarum 
Fennicae, Bd. 34 (1907), Nr. 6. 
_* T. Levi-Civita, Traiettorie singolari ed urti nel problema ristretto dei tre corpi, Annali 
di matematica pura ed applicata, Ser. III*, Bd. 9 (1904), S. 1-32. 
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dann allgemein von Bisconcini’ behandelt. Sundman fiihrte diese Untersuchun-- 
gen weiter und zeigte insbesondere, dass beim Zweierstoss der Punkt ¢ = 0 ein 
algebraischer Verzweigungspunkt zweiter Ordnung fiir die Lésungen ist und 
dass die Koordinaten wu; in der Umgebung dieses Punktes regulare Funktionen 
der Ortsuniformisierenden ¢' sind. 

Auch der Dreierstoss wurde von Sundman eingehend untersucht. Er 
bewies, dass beim Dreierstoss die Ausdriicke rt ?* (k = 1, 2,3) firt0 positive 
irenzwerte 7; haben, fiir welche entweder 7; = f2 = 73 oder fe = ?3 + 7; oder 
Tz = 1 + fe oder 7, = 7 + 73 gilt. Da die drei letzten dieser 4 Faille durch 
zyklische Vertauschung der Indizes 1, 2, 3 ineinander tibergefiihrt werden kén- 
nen, so wollen wir uns im folgenden nur noch mit dem ersten und zweiten 
beschaftigen. Zeichnet man ein Dreieck aus den Seiten 7; , 72, 73, so erhilt 
man im ersten Fall ein gleichseitiges Dreieck, im zweiten Fall drei Punkte einer 
Geraden; wir wollen daher von dem gleichseitigen und dem geradlinigen Fall 
sprechen. Ferner zeigte Sundman, dass bei jeder Dreierstossbahn die Ebene 
des Dreiecks A;A2A;3 fiir variables ¢ nur eine Parallelverschiebung mit konstanter 
Geschwindigkeit erleidet. Da bekanntlich die Differentialgleichungen des 
Dreikérperproblems ungedndert bleiben, wenn man ein beliebiges neues recht- 
winkliges Koordinatensystem einfiihrt, das einer Translation mit konstanter 
Geschwindigkeit oder einer Drehung um konstante Winkel unterworfen wird, 
so braucht man fiir die weitere Untersuchung des Dreierstosses nur noch das 
ebene Dreikérperproblem zu behandeln und kann x% = O (k = 1, 2, 3) 
voraussetzen. 

Aus den Resultaten Sundmans folgt sofort, dass beim Dreierstoss die drei 
Winkel des Dreiecks A,A24A; fiir t — 0 Grenzwerte haben; im gleichseitigen Fall 
streben sie nimlich simtlich gegen 37 und im geradlinigen Fall strebt der Winkel 
bei Az gegen 7, wahrend die beiden anderen Winkel gegen 0 streben. Es war 
aber bisher nicht bekannt, ob auch die Winkel der Dreiecksseiten mit den 
Koordinatenachsen fiir t — 0 Grenzwerte haben, d. h. ob die drei K6érper in 
bestimmten Richtungen zusammenstossen. Diese Frage wird im folgenden 
bejahend beantwortet werden. In engem Zusammenhang mit der Entscheidung 
dieser Frage steht das Problem der Entwickelbarkeit der Koordinaten der 
kollidierenden Massenpunkte in irregulire Potenzreihen der Variabeln ¢. Wir 
werden solche Reihenentwicklungen wirklich aufstellen und damit zugleich 
beweisen,dass beim Dreierstoss der Punkt ¢ = 0 im allgemeinen ein logarith- 
mischer Verzweigungspunkt fiir die Lésungen ist. Dieses Ergebnis bezeichnet 
einen wesentlichen Unterschied gegeniiber dem Zweierstoss. Die Reihenent- 
wicklungen werden uns endlich einen vollen Uberblick iiber simtliche Dreier- 
stossbahnen in der Nihe von t = 0 verschaffen. Sieht man zwei Lésungen 
nicht als verschieden an, wenn sie durch eine Drehung des Koordinatensystems 
um konstante Winkel oder durch eine Translation mit konstanter Geschwindig- 
keit ineinander tibergefiihrt werden kénnen, so zeigt es sich, dass in der Nahe 





*G. Bisconcini, Sur le probléme des trois corps, Acta Mathematica, Bd. 30 (1906), 8. 
49-92. 
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yon t = 0 die simtlichen Dreierstossbahnen im gleichseitigen Fall von 3 Para- 
metern analytisch abhangen und im geradlinigen Fall von 2 Parametern. 

Wir wollen das weiterhin abzuleitende hauptsichliche Resultat unserer Un- 
tersuchung noch priazis formulieren. Zur Abkirzung werde gesetzt 


_ M2ms + mgm + Mm 
(m; + m2 + ms)? 





und 
_ m{1 + (1 — w+ -4)*} +m +o7 +o 
m + melo? + (1 — w)*} + ms 


wo w die im Intervall 0 < w < 1 gelegene Lésung der Gleichung 


m{(1 —w)” — (1 — w)} + mefwo(1 — w)”* — (1 — ww *} + mew —w’) = 0 





b 


’ 


bedeutet; ferner sei 
a, = 3{-1 + [13 + 12 (1 — 3a)*J}}, ag = 3{-1 4+ [13 — 12 (1 — 3a)*}}}, 
bi = §{-1 + (25 + 16b)'}. 
Ist dann weder = noch 3 eine ganze Zahl, so lassen sich simtliche Koordinaten 
a2 
a, yx (k = 1, 2, 3) im gleichseitigen Falle des Dreierstosses in der Form 
te= Ere (ui, ue, Us), ye = Oye (ur, Ue, Us) (k = 1, 2, 3) 
ausdriicken, Wo 2g (U; , Uz, Us) und yg (U4, U2, Us) Potenzreihen in den Gréssen 
2/3 


uy= ayt ) ue = aot", U3 = ast? 


mit konstanten a;, a2, a3 bedeuten; dabei hingen die Koeffizienten dieser 
Potenzreihen nur von m,, m, m3 ab. Die Werte a, a2, as sind eindeutig 
durch die Dreierstossbahn bestimmt, und umgekehrt liefert jedes System von 
reellen Werten a, a2, a3 Wieder eine Dreierstossbahn, fiir welche der gleich- 


seitige Fall vorliegt. Ist ferner keine ganze Zahl, so haben die Koordinaten 


im geradlinigen Fall des Dreierstosses die Form 


2/3 _* 2/3. * as 
ty, = Cty (01, v2), Yr = CY (V1 , V2) (k = 1, 2, 3), 
— * . . 
WO (v1 , v2) und y (v1 , v2) Potenzreihen in 
2/3 b 
1 = Bil’, ve = Bet" 


mit konstanten durch die Bahn eindeutig bestimmten Werten 61, 62 bedeuten; 
umgekehrt ergibt jedes reelle System #; , 6: eine Dreierstossbahn fiir den gerad- 





linigen Fall. Ist = eine ganze Zahl g, so bleiben die obigen Aussagen bestehen, 
a2 


wenn man darin wu; durch die Formel 


uy = (ay + C103 log t) 
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erklart, wo c; eine gewisse nur von m , Mm, m3 abhangige Konstante bedeutet. 


r) ° . a, 
Im Falle eines ganzzahligen — = A hat man entsprechend 
a2 


Uu = t"' (ae 4- C20’ log t) 


zu erklaren, und fiir ganzzahliges - = 7 ist 


ve = t'(Bo + e983 log t) 


zu setzen, wobei cz und cz wieder nur von m,, mz , ms abhangen. 

Es sei noch daran erinnert, dass wir im geradlinigen Falle zwei weitere von 
zwei Parametern abhingige Scharen von Dreierstossbahnen erhalten, wenn wir 
m,, M2, m3 zyklisch vertauschen. Ausserdem kann man noch eine beliebige 
Drehung des Koordinatensystems um konstante Winkel vornehmen, sowie eine 
beliebige Translation mit konstanter Geschwindigkeit. 

In den ersten 6 Paragraphen der vorliegenden Arbeit werden im wesentlichen 
die Ergebnisse, welche Sundman fiir den Dreierstoss erhalten hat, in etwas 
verinderter Art hergeleitet. In §7 werden spezielle Dreierstossbahnen, die in 
den bekannten Lagrangeschen partikuléren Lésungen des Dreikérperproblems 
enthalten sind, kurz besprochen. Als weitere Vorbereitung wird in §8 unter 
Anwendung der Jacobi-Hamiltonschen Theorie die Transformation der Dif- 
ferentialgleichungen des Dreikérperproblems behandelt, welche man in der 
Astronomie als Elimination der Knoten bezeichnet.* Es wird dann die Bestim- 
mung aller Dreierstossbahnen zurtickgefiihrt auf die Lésung folgender Aufgabe: 

Es sei 

db; 

(3) = hs (k = 

ein System von n Differentialgleichungen erster Ordnung, dessen rechte Seiten 
Potenzreihen der unbekannten Funktionen 6,,---, 6, sind, aber nicht die 
unabhangige Variable s explizit enthalten. Die Reihen f, mégen keine kon- 
stanten Glieder haben, und es sei ay; der Koeffizient von 6; (lJ = 1, --- , m) in 
den linearen Gliedern von f, (k = 1,---, ”). Von den charakteristischen 
Wurzeln der Matrix (a,;) mégen genau p einen negativen Realteil haben und 
keine den Realteil 0. Man gebe simtliche Lésungen des Systemes (3) an, 
welche fiir s — «© den Grenzwert 0 haben. 

Diese Aufgabe ist unter noch allgemeineren Voraussetzungen tiber die Funk- 
tionen f, von Bohl’ bearbeitet worden. Er ‘zeigte auf sehr geistreiche Weise, 
dass jene Lésungen von p Parametern abhangen. Die von Bohl verwendeten 
topologischen Hilfssaitze beruhen aber wesentlich auf indirekten Schliissen und 
k6nnen nicht ohne weiteres zu einer Konstruktion der Lésungen benutzt werden. 





* Vergl. S. 339-341 in E. T. Whittaker, A treatise on the analytical dynamics of particles 
and rigid bodies, with an introduction to the problem of three bodies, Cambridge (1904). 

5 P. Bohl, Sur certaines équations différentielles d’un type général utilisables en mécanique, 
Bulletin de la société mathématique de France, Bd. 38 (1910), S. 5-138. 
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Deshalb wird im folgenden in den Paragraphen 11 und 12 das Problem auf eine 
andere Art behandelt werden, durch Benutzung gewisser Reihentransforma- 
tionen, welche die gesuchten Lésungen in expliziter Gestalt ergeben. Die 
irregularen Potenzreihen, durch welche wir die Lésungen ausdriicken werden, 
treten bereits in Poincaré’s Untersuchungen tiber asymptotische Bahnkurven 
auf; dort fehlt aber gerade der Nachweis, dass diese Reihen wirklich alle asym- 
ptotischen Bahnen darstellen. Die Art, in der wir die Cauchysche Majoranten- 
methode anwenden, weicht von der tiblichen ein wenig ab und bietet vielleicht 
auch ein selbstindiges Interesse, da sie fast ohne Rechnung zum Ziele fiihrt. 

Im letzten Paragraphen werden schliesslich unsere Resultate iiber das System 
(3) angewendet auf die geeignet transformierten Differentialgleichungen des 
Dreikérperproblemes und ergeben den oben ausgesprochenen Satz iiber die 
Mannigfaltigkeit der Dreierstossbahnen. 


1. ALGEBRAISCHE VORBEREITUNGEN 


Ist F eine Funktion der Zeit t, so soll die erste und die zweite Ableitung von 
F nach t in tiblicher Weise mit F und F bezeichnet werden. Unter, --- , y17 
wollen wir weiterhin Gréssen verstehen, die langs der zu betrachtenden Bahn- 
kurve des Dreikérperproblems konstant sind. Zur Abkiirzung werde noch 


(4) m = m + m2 + ms; 


gesetzt. 

Wir schreiben zunachst die bekannten algebraischen Integrale des Drei- 
kérperproblems auf, naimlich die Flachenintegrale, die Schwerpunktsintegrale 
und das Energieintegral. Die Flachenintegrale lauten 


3 3 
z: MAYRe — ZY) = V1, . mi(Zet~ — Leen) = Y2, 
(5) k=l k=1 


3 
>» mi(Le Ye — Yeu) = Ys. 
k=1 


Die Schwerpunktsintegrale besagen, dass der Schwerpunkt der drei Massen- 
punkte sich geradlinig und gleichférmig bewegt, nimlich 


id i< 
— >) mete = at + 75, ~ Y meyn = et + 7, 
Mm k=1 ™M k=1 


1 3 
— Yo mz = yet + V0. 
™M k=1 


Da die Differentialgleichungen (2) sich nicht andern, wenn a; , ye, 2 dureh 
Le + yat + ¥5, Ye + vot + 77, 2 + vst + v9 (k = 1, 2, 3) ersetzt werden, so 
kann man weiterhin voraussetzen, dass der Schwerpunkt fest im Koordinaten- 
anfangspunkt liegt, dass also die Gleichungen 

3 3 3 
(6) >: m,2x;, = 0, yt my. = 9, > m,z, = 0 

k=1 k=1 k=1 


gelten. Bedeutet 
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3 
(7) T= 420 mite + ye + %) 
k=1 
die lebendige Kraft des Punktsystems, so ist 
(8) i —_ U = V10 


das Energieintegral. 
Wir setzen nun 





















3 
(9) J= > mix + ye + 2%). 


Dann ist 








3 


(10) ij = } ® mite te + YRYe + 22x), 


t=1 
3 3 
i= DV mat+e+ a t+ > mi(tete + Yet + 2%); 


k=1 






also nach (2) und (7) 


3 
(11) W =r t O(n +e aU 2 90), 


OX OZ 


Da nun U eine homogene Funktion der Dimension —1 in den 9 Variabeln 


21, ---, 23 ist, so ist nach einem bekannten Eulerschen Satze 
3 
0U - dU 
& (x bo + Yi a “Sai 2k Ay = —U 
Lk Oz 


und (11) ergibt die Lagrangesche Formel 
ij = 2T — U, 





also nach (8) 


(12) iJ =T+ Y10 ;» 
(13) 1f =U+2Qy. 
Sind B,, --- , B, und C,, --- , C, zwei Reihen von je n Gréssen und bezeich- 


net man mit 
Dyq = B,C, — BC» (lsp<qs7”) 


die n(n — 1)/2 zweireihigen Unterdeterminanten der aus diesen beiden Reihen 
gebildeten Matrix, so gilt identisch 


n n n 2 
(14) > Bs b & Cs 7 (x B,C) ov ps x. 
p=1 p=1 p=1 Pq 


Wir wahlen n = 9 und identifizieren die Paare B,, Cp (p = 1, --- , 9) mit 
wer my , Wir/m, (We = Te, Ye, % jk = 1, 2,3). Behalt man die Abkiirzung 





a a ~~ mmr ~ > 
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D,. bei, so geht (14) aufolge (7), (9), (10) iiber in 
(15) aTJ — iJ’? = } =... 
P.q 


Also gilt insbesondere die Ungleichung 
(16) J? < 8JT. 
2. Dir SINGULARITATEN DER BAHNKURVE 


Wir wenden auf die Differentialgleichungen des Dreikérperproblems folgenden 
bekannten Existenzsatz an: 


Es seien F;, --- , F, Funktionen von n Variabeln s,, ---, s,, die in der 
Umgebung eines Punktes s; = 01, --- , 8: = o, in Reihen nach Potenzen von 
81 — 01, °** 5 8, — o, entwickelbar sind. Es sei 6 eine positive Zahl, fiir welche 


die Potenzreihen in dem Gebiete 
(17) [se —o| S6,---,|8. —on| S85 


konvergieren; ferner sei K eine gemeinsame obere Schranke der Werte 
|F,|,---, |, | in diesem Gebiete. Es sei ¢ eine reelle Variable und & irgend 
ein Wert von t. Es gibt genau ein System von n Funktionen ¢), --- , g, der 
reellen Variabeln ¢ mit folgenden Eigenschaften : 

1) Die Funktion ¢;(t) ist differentiierbar in einer Umgebung von ¢ = & und 
es ist gx (to) =o(k=1,---, n); 

2) in dieser Umgebung von t = f& geniigen s; = gi(t), --- , 82 = ¢n(t) den 
Differentialgleichungen 





ds;, 
Ks gibt ausserdem eine nur von 6, K und n abhingige positive Zahl 7, sodass 
die Funktionen s, = g;(t) in Reihen nach Potenzen von ¢ — t entwickelbar 
sind, welche samtlich fiir | ¢ — & | < 7 konvergieren und in diesem Gebiete den 
Ungleichungen (17) geniigen. 
Um diesen Satz auf das System (2) anzuwenden, wahlen wir n = 18 und 
setzen fir k = 1, 2, 3 


= F, (k =1,---,n). 


= &, Yx = Sk+3 5 Ze = Sk46, 
t = Sirg = F;,, Ye = Sry2 = Preys, Ze = Sits = Frye, 
1 aU 1 aU 1 aU 
—— =F i ot on. BF — — = Fyrs. 
m, Ox, k+9 ) mM: ae k+12 ; mM, Oz; k+15 
Zur Zeit t = t seien irgend welche endlichen reellen Anfangswerte o1, --- , 718 
der 18 Gréssen x, --- , 23 gegeben, fiir welche keine Kollision vorliegt, also 
die Abstande r,, re, 13 grésser als 0 sind. Wir berechnen aus den Zahlen 
1, --- , oignach (1) und (8) den Anfangswert U = Uo und den Wert der Kon- 


stanten yo des Energieintegrales. Ist dann K, irgend eine obere Schranke fir 














SS ee ee oa 


= 2 
ae 


—e 
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Uy , so gelten nach (1), (7), (8) fiir die Anfangswerte die Ungleichungen 


| Ky a | Ky | Ky 
n<—, mw<m—, F<, 
Me2™M3 ; 







te + ye + 2 S 2(Ki + y0)mz' 









Folglich lassen sich zwei positive Gréssen 6 und K als Funktionen von m,, 
mz , m3, Ky und yw allein so wahlen, dass die Voraussetzungen des Satzes erfiillt 
sind. Die durch die Anfangswerte eindeutig bestimmte Lésung s = ¢;(t) 
(k = 1,---, 18) der Differentialgleichungen ist dann regular in einem Zeit- 
intervall tf) — 7 < t < t + 7, wo die positive Zahl 7 nur von m , me , m3, Ky , yy 
abhangt. 

Wir betrachten jetzt, von t = & ausgehend, die Funktionen g;,(¢) fiir fallende 
reelle Werte von t. Entweder sind sie simtlich regular fiir alle endlichen reellen 
t < ty oder aber es gibt einen endlichen Wert ¢ = t; < & , sodass alle Funktionen 
im links offenen Intervall 4; < ¢ S t regular sind und mindestens eine von 
ihnen fiir ¢ = 4 singular ist. Eine analoge Aussage gilt fiir ¢ 2 t und wachsende 
Werte von ¢. Da die Differentialgleichungen (2) bei der Transformation 
t — —tin sich tibergehen, so kann man sich auf die Untersuchung fiir fallendes 
t < t) beschranken. Weil die Differentialgleichungen die Zeit ¢ nicht explizit 
enthalten, so kann man ausserdem noch 4; = 0 voraussetzen. Wegen der 
Regularitat fiirfs — 7 <t<& + ristdann7r St. NunseiO < & < rund 
U’ = Us. der Wert von U fiirt = t. Ware auch U2 < K,, so waren die Funk- 
tionen g(t) auch samtlich in dem Intervall t — 7 < t < t + 7 regular; aber dies 
ist ein Widerspruch, da das Intervall den singularen Punkt ¢ = 0 enthilt. 
Fiir 0 < ¢ < 7 gilt daher ( = K,, und dabei hangt 7 nur von m,, me, m3, 
yi und K,; ab. Da K, beliebig gross sein kann, so folgt, dass U iiber alle 
Schranken wachst, wenn ¢ zu 0 abnimmt. Dies bedeutet, dass fiir ¢ — 0 der 
kleinste der 3 Abstande 7 , 72, 73 gegen 0 strebt. 

In folgenden bedeuten 7, 72, 73 geeignete hinreichend klein zu wablende 
positive Zahlen, die nur von den Massen m; , m2, m3 und den gegebenen Anfangs- 
werten von 2, --- , Z;abhangen. Auf der betrachteten Bahnkurve ist U — « 
fiir t > 0, also nach (13) 























(18) J>0 (0 <t< 7). 








Daher ist J im Intervall 0 < ¢ S 7, eine konvexe Funktion von ¢ und hat folglich 
fiir ¢ > 0 einen Grenzwert J) , der positiv oder 0 sein kann. Im Falle Jo > 0 
folgt aus (6) und (9), dass der grésste der drei Abstinde r;, re, 73 fiir t > 0 
oberhalb einer positiven Schranke bleibt. Da andererseits der kleinste dieser 
Abstiande gegen 0 strebt und sie simtlich fiir ¢ > 0 stetige Funktionen von ¢ sind, 
so ergibt sich, dass fiir t 0 eine bestimmte der Dreieckssciten r;, gegen 0 strebt, 
wahrend die beiden anderen oberhalb einer positiven Schranke bleiben; es 
stossen dann also zur Zeit ¢ = 0 genau zwei von den Kérpern zusammen. Dieser 
Zweierstoss ist von Sundman vollstandig untersucht worden. Er hat gezeigt. 
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dass die Koordinaten der drei K6rper in der Umgebung von ¢t = 0 reguliare 
Funktionen der Ortsuniformisierenden ¢’”* sind. 

Weiterhin beschaftigen wir uns dauernd mit dem Fall J/g = 0. Dann streben 
aber nach (9) fiir ¢ — 0 alle drei Abstande 7, , 72 , 73 gegen 0 und es stossen zur 
Zeit t = 0 alle drei Kérper im Nullpunkt zusammen. Es soll genauer unter- 
sucht werden, in welcher Weise dieser Dreierstoss vor sich geht. 

3. DAS ASYMPTOTISCHE VERHALTEN VON J UND J 

Nach (18) ist die positive Funktion J konvex im Intervall 0 < ¢ S 7, 
ferner strebt J gegen 0 firt— 0. Folglich gilt 
(19) J>0 0 <t< 7). 
Andererseits ist nach (12) 

inn canis FS uae one i(8J 7 ie Par? ob ye adel 
und hierin ist die linke Seite gerade die Ableitung der Funktion /J~'" nach ¢. 
Integriert man diese Gleichung zwischen den Grenzen ¢ und 7 und bezeichnet 
mit J; und J; die Werte von J und J fiir t = 7,, so folgt 


0) divat— Jot =a fur - J) dt + ae [at 
t 


Wir wollen nun beweisen, dass die beiden Integrale auf der rechten Seite fiir 
t — 0 endliche Grenzwerte haben. Wir verstehen weiterhin unter uw, --- , us 
gewisse positive Zahlen, die geniigend klein zu wahlen sind und nur von m,, 
m:, ms; abhaéngen. Da der Nullpunkt im Schwerpunkt des Dreiecks A,A2A3 
liegt, so ist 


J > mri + 73 + 73) 
und folglich 
U> mJ, 
also auch 
U + 2x0 > usd? (0 <t< 7). 
Im Intervall 0 < ¢ < 72 gilt dann nach (13) und (19) 
(I?) = I > dust I~ 
J? > 8ysJ' 
(21) JI" > us 
| I> ust'”. 


Demnach konvergiert das zweite Integral in (20) bis nach t = 0. Zufolge (19) 
und (20) ist dann das erste Integral in (20) fiir t + 0 nach oben beschrankt; da 
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aber sein Integrand wegen (16) nicht-negativ ist, so konvergiert es ebenfalls 
bis nach t = 0. Nach (20) existiert also 


(22) lim JJ” = V8 s 


t—0 


und zwar ist dieser Grenzwert zufolge (21) eine positive Zahl. Durch Integra- 
tion von JJ“ folgt aus (22) die asymptotische Gleichung 


J mw Byut (t > 0) 
(23) J ~ nt? 
mit 
(24) dh = (3m)*" > 0, 
also nach (22) 
(25) J ~ 4a". 
Damit ist das asymptotische Verhalten von J und J klargelegt. 
4, DAS ASYMPTOTISCHE VERHALTEN VON U unp T 
Setzt man 
(26) (JT — J)" = gid, 
so ist nach (16) 
(27) g(t) 2 0 
und nach §3 das Integral 


| gers * dt 
0 


konvergent. Zufolge (23), (24), (25) konvergiert dann auch 


T1 d 
(28) [ 9%. 
0 t 
Wir wollen beweisen, dass g(t) fiir t + 0 den Grenzwert 0 besitzt. Wegen (27) 


und der Konvergenz des Integrales (28) ist jedenfalls 
(29) lim inf g(é) = 0. 


t—0 
Ware nun 
lim sup g(t) > 0, 
t—0 
so kénnte man wegen (29) und der Stetigkeit von g(t) fir ¢ > 0 eine positive 


Zahl y12 und eine monoton zu 0 abnehmende Folge 7; > t; > t2 > --- so finden. 
dass 








falls 


gra- 


> 0) 
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(30) G(len) = 2, G(tena) = 372 (n = 1, 2,3, ---) 
und in jedem Intervall t, S ¢ S tn-1 die Ungleichung 

(31) ye S g(t) S 37x 

gilt. 


Nach (26) ist 

T= 3d + gQPys™, 
also nach (23), (25), (31) 
(32) T < st“ (tn St S tnijn = 1, 2,3, ---). 
Ferner ist 

3 , oer 
t-U=-> (Sint in Wa), 

also nach (1), (7), (8), (32) 
(33) P| <ul? < yt, 
wieder im Intervall ta, S ¢ S tens (n = 1, 2,3,--- ). Folglich gilt dort auch 


nach (23), (25), (32), (33) die Ungleichung 
| (8JTt") | < ret", 


und die Funktion 8/7t** andert sich daher im Intervall t, < ¢ < tn». um 


weniger als 
‘a dt 
o ton t ; 


In dem gleichen Intervall andert sich ferner die Funktion J/*t*” zufolge (25) 
um weniger als 712 , wenn nur fan; < 73 ist, also fiir alle geniigend grossen Werte 
von n. Nach (26) und (30) erhalt man dann 


ton—1 dt 
212 = g(tena) — glten) < ¥2 + vu | — 


, 
ton t 
also 


ten—1 
/ dt ye 
t t 716 


und nach (31) 





ton—1 
(34) | g(t) : > > 0, 


ton 


fir alle geniigend grossen n. Durch Summation iiber n folgt aber aus (34) ein 
Widerspruch gegen die Konvergenz des Integrales (28). 
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Damit ist bewiesen, dass g(t) fiir t ~ 0 den Grenzwert 0 besitzt. Es ist also 


auch 
(35) sJT — J° = o(t’”) 
und nach (23), (25) 
(36) T ~ ent", 
nach (8) 
(37) U~ ene. 
5. EBENE BEWEGUNG 

Aus (15) und (35) ersieht man, dass jede der 36 in (15) auftretenden zweirei- 

higen Determinanten D,, fiir £ + 0 von kleinerer Gréssenordnung als ¢’” ist. 


Sind w und w irgend zwei der 9 Koordinaten 2, , --- , 23, so gilt also 
(38) wry — wed = o(t'*). 

Nach (5) haben daher die Flaichenkonstanten 7: , yz, y3 alle drei den Wert 0. 
Hieraus ergibt sich nun leicht, dass die Bewegung in einer festen Ebene durch 
den Nullpunkt vor sich geht: 

Die Differentialgleichungen (2) sind invariant gegeniiber einer Drehung des 
Koordinatensystems um konstante Winkel. Deshalb kann man annehmen, 
dass zur Zeit ¢ = t die drei Punkte A; , Az , A3in der Ebene z = Oliegen. Wegen 
vy = 0, y2 = 0 ist dann also 


3 3 
(39) > MY Zk = 0, } x Mi Xk Zk = (0 
k=1 k=1 


fir? = t&. Nach (6) ist ferner 
3 
(40) > mez. = 0. 
k=1 


Liegen nun die drei Massenpunkte zur Zeit ¢ = t nicht auf einer Geraden, so 
ist die aus den drei Zeilen 2 , 42 , 23 } Y1, Y2, Ys 3 1, 1, 1 gebildete Determinante 
von 0 verschieden, und aus (39), (40) folgt das Verschwinden der drei Werte 
21, 22, 23 fiir t = t. Liegen andererseits die drei Punkte zur Zeit t = & auf 
einer Geraden, so kann man durch eine Drehung des Koordinatensystems er- 
reichen, dass diese Gerade die z-Achse wird und ausserdem die Geschwindig- 
keitskomponente 2; fiir f = f verschwindet. Nach (39) und (40) ist dann aber 


(41) 4221 + MeXo2Zo = 0, m2) + M222 = ( 


fir t = &. Da zur Zeit t = & keine Kollision stattfindet, so ist dann 2, ¥ 22, 
und aus (41) folgt wieder das Verschwinden von 2; und 2 fiir f = t). In jedem 
Fall liegen also die Richtungen der Bewegungen der drei Massenpunkte zur 
Zeit t = t ebenfalls in der Ebene z = 0, und aus den Differentialgleichungen 
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(2) folgt nach dem Eindeutigkeitssatz, dass die Bewegung ganz in der Ebene 
z = Oerfolgt. 

Wir kénnen daher weiterhin z, = 0 (k = 1, 2, 3) annehmen und brauchen 
nur noch ebene Bewegungen zu betrachten. 


6. DAS ASYMPTOTISCHE VERHALTEN DER DREIECKSSEITEN 


Bedeutet w eine beliebige der 6 Koordinaten 2; , y, (k = 1, 2, 3), so wollen 
wir die Abktirzung 


—2/3 


(42) w* = wt 


aii * ~2/3 . : ; 
einfihren, also z.B. 2} = at". Ist allgemeiner eine homogene Funktion 


von %1,-+--, Ys, so wollen wir unter &* den Wert verstehen, den man erhilt, 
wenn man in ® alle Variabeln w durch w* ersetzt; z.B. ist also 


* —2/3 y ry2/3 
re = rit”, U* = Ut". 


Nach (23) und (36) gilt die Abschatzung 


(43) w= O0("), w= ot”). 
Hieraus folgt nach (42) 
(44) w* = O11), ow = wt” — 2ot® = Ot). 


Ferner ist nach (38), wenn wy ebenfalls eine Koordinate bedeutet, 

(45) w*wo — wow* = wt 7? (riot 7? — 2uot ”) — wot (wt? — 2wt”) = o(t7’) 
und nach (23), (26) 

(46) J*~y, J* = JO -— gut” = o(t’). 


Aus (44) und (45) erhalt man 
3 

(47) J*0* — dwtd* = > mlae(zew* — w* ae) + ye(ye wd” — wpe) = o(t”’) 
k=1 


und aus (44) und (46) 
(48) J*w* — jw*J* = dw* + oft’). 


Aus (24), (47), (48) ergibt sich 


(49) w* = o(t’). 
Es ist 
aU* _ wu ti’ 
ow* Ow 


und folglich gehen die Differentialgleichungen (2) durch die Substitution (42) 
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Damit ist bewiesen, dass g(t) fiir 1 + 0 den Grenzwert 0 besitzt. Es ist also 
auch 


(35) sJT — J* = o(t’”) 
und nach (23), (25) 
(36) Twos. 
nach (8) 
(37) U~ ee. 

5. EBENE BEWEGUNG 


Aus (15) und (35) ersieht man, dass jede der 36 in (15) auftretenden zweirei- 
higen Determinanten D,, fiir  —~ 0 von kleinerer Gréssenordnung als ¢”” ist. 
Sind w und w irgend zwei der 9 Koordinaten 2, --- , 23, so gilt also 


(38) wy — wo = o(t'”), 


Nach (5) haben daher die Flichenkonstanten 7: , y2, ys alle drei den Wert 0. 
Hieraus ergibt sich nun leicht, dass die Bewegung in einer festen Ebene durch 
den Nullpunkt vor sich geht: 

Die Differentialgleichungen (2) sind invariant gegeniiber einer Drehung des 
Koordinatensystems um konstante Winkel. Deshalb kann man annehmen, 
dass zur Zeit t = & die drei Punkte A; , A2 , Azin der Ebene z = Oliegen. Wegen 
v1 = 0, ye = 0 ist dann also 


3 3 
(39) z Mr Yr Zk = 0, »» Mi Lie Zk = 0) 


k=1 


flirt = t&. Nach (6) ist ferner 
3 
(40) dy mez = 0. 
k=1 


Liegen nun die drei Massenpunkte zur Zeit ¢ = t nicht auf einer Geraden, so 
ist die aus den drei Zeilen 21 , 2 , 23 ; Y1, Y2, Ys 3 1, 1, 1 gebildete Determinante 
von 0 verschieden, und aus (39), (40) folgt das Verschwinden der drei Werte 
21, 22, 23 fiir ¢ = t&. Liegen andererseits die drei Punkte zur Zeit t = & auf 
einer Geraden, so kann man durch eine Drehung des Koordinatensystems er- 
reichen, dass diese Gerade die x-Achse wird und ausserdem die Geschwindig- 
keitskomponente 2; fiir f = t) verschwindet. Nach (39) und (40) ist dann aber 


(41) M4212) + Meo2 220 = 0, M42, + M222 =—_ 


fir? = %&. Da zur Zeit t = & keine Kollision stattfindet, so ist dann 21 # 2, 
und aus (41) folgt wieder das Verschwinden von 2, und Z fiir t = t). In jedem 
Fall liegen also die Richtungen der Bewegungen der drei Massenpunkte zur 
Zeit t = t ebenfalls in der Ebene z = 0, und aus den Differentialgleichungen 








also 


irei- 
ist. 


rch 


yen 


roy 


ur 
en 





DER DREIERSTOSS 139 


(2) folgt nach dem Eindeutigkeitssatz, dass die Bewegung ganz in der Ebene 


z = O erfolgt. 
Wir kénnen daher weiterhin z = O (k = 1, 2, 3) annehmen und brauchen 


nur noch ebene Bewegungen zu betrachten. 


6. DAS ASYMPTOTISCHE VERHALTEN DER DREIECKSSEITEN 
Bedeutet w eine beliebige der 6 Koordinaten 2; , y, (k = 1, 2, 3), so wollen 
wir die Abktirzung 
(42) w* = wt 


— *K —2/3 : . ° 
einfiihren, also z.B. x) = at Ist allgemeiner ® eine homogene Funktion 


von 21, ---, Ys, So wollen wir unter &* den Wert verstehen, den man erhilt, 
wenn man in ® alle Variabeln w durch w* ersetzt; z.B. ist also 


* —2/3 y 72/3 
re = rit”, U* = UF". 


Nach (23) und (36) gilt die Abschatzung 


(43) w=O0(0"), w= Ot”). 
Hieraus folgt nach (42) 
(44) w* = O11), ww = wt? — 2wt””® = OC’). 


Ferner ist nach (38), wenn wy» ebenfalls eine Koordinate bedeutet, 


(45) wre — wow* = wt 8 (wot °? — 2wot °*) — wot ” 5(wt?* — 2wt*) = o(t™) 
und nach (23), (26) 
(46) J*~yr, Jt = Jt -— 40 = oft’). 


Aus (44) und (45) erhalt man 


3 
(47) J*w* — dw*J* = y» mlx (rew* — wae) + ye (ye — w"ye)} = oft) 
=1 


und aus (44) und (46) 
(48) J*w* — 4w*J/* = dw* + o(t”’). 


Aus (24), (47), (48) ergibt sich 


(49) w* = o(t'). 
Es ist 
aU" _ aU wn 
ow* ow 


und folglich gehen die Differentialgleichungen (2) durch die Substitution (42) 
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liber in 


< _ sl ce 


my (Wy a (we = Le, Ye; k = 1, 2,3) 
k 


Ald). pie ie i aU* 


(50) — Swe + (ve t —;- 
mM: Ow; 


Nun sei e > 0,¢€— 0. Im Intervall « S ¢ S 2e gilt nach (49) 


(51) wt = (w*)ime + of *... (w*) re + o(1). 


Ferner ist nach (37) 
U* ~ $; 
also sind fiir t — 0 die reziproken Werte der Gréssen Th (k = 1, 2, 3) beschrankt 


und das gleiche gilt dann fiir die zweiten partiellen Ableitungen von U* nach 
seinen Variabeln w*. Nach dem Mittelwertsatz folgt aus (51) demnach 


aU* aU* 
ea, Me eee Sts 2). 
(52) at = (3) + oft (Sts 2 
Ausserdem ist nach (49) 


(53) | (wrt'*)'?” dt = [w*?]i — 2 | w*t dt = o(t) (ex 


Wir integrieren jetzt die Gleichung (50) zwischen den Grenzen e und 2¢ und 
ersetzen nachtraglich wieder e durch t. Nach (51), (52), (53) folgt dann 
1 9aU* 


— 2uzt + o(f) = — —{ t + off) 
mM, OW; 


(54) nea = ne (0). 

My OW; 
Diese Relation ist offenbar invariant bei beliebiger orthogonaler Transformation 
des Koordinatensystems. Wir wihlen ein derartiges bewegliches Koordinaten- 
system durch den Schwerpunkt, dass die Strecke A3A; parallel zur Abszissenachse 
ist und A: eine nicht-negative Ordinate hat. Sind X;,, Y; die neuen Koordi- 
naten von A; (k = 1, 2, 3), so ist also X; > X3, Y1 = Y3, Yo = 0. Setzt man 
noch 


(55) yA = Xi = X3, Pe = Xe me X3, P3 = Y2 —_ Y3, 


so haben im neuen System die Punkte A; und Ag in bezug auf A; die Relativ- 
koordinaten p; , 0 und p., ps. Es sei wieder 


x ae xe. Y; = re. De _ pit. 





_ 2, 3) 


‘ankt 
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Benutzt man (54) mit Y: , Y3 anstelle von w; , so folgt durch Subtraktion 


at? 1 
(56) Ps \ #3 — #8 — 0. 


Verwendet man (54) analog fiir bey ; X und fir X? , Xf , so erhalt man 





* 4 * * 
(57) 2p: + m(® — oat *) — (m, + ms) 3 — 0 
’3 rT Te 
" 2 wai ve * * 
(58) pe +m ("3 oa F) = (m2 + ms) =. —@. 
rs Ye r} 


Nach (23) sind die Werte pe (k = 1, 2, 3) beschrainkt; nach (37) sind auch die 
reziproken Werte der rg beschrankt. Wir betrachten jetzt irgend eine Folge 
t — 0, fiir welche die zugehérigen Werte der Pe gegen Grenzwerte p, streben. 
Es seien 7; (k = 1, 2, 3) die Grenzwerte der Gréssen rt. Da der Schwerpunkt 
des Dreiecks im Nullpunkt liegt, so streben auch die Punkte (X; , Yr) gegen 
gewisse Grenzpunkte A; (k = 1, 2,3). Zufolge (56) gilt 
p(fs* — fr") = 0, 

also entweder p3; = 0 oder 72 = 73. 

Liegen die drei Punkte A,, A, As; nicht auf einer Geraden, so ist p; ¥ 0, 


also 7, = 73, und durch zyklische Vertauschung folgt auch 7; = 7; ; daher ist 
das Dreieck A,A:A; dann gleichseitig. Bedeutet r die Dreiecksseite, so ist 


offenbar 

(59) A=1, fe=4r, ps = rv, 
also nach (4) und (57) 

(60) r= m, 


ferner nach (37) 
(61) $\ = (moms + msm + mym)r. 

Liegen A, , A, , A; auf einer Geraden, so kann man nach etwaiger zyklischer 
Vertauschung der Indizes voraussetzen, dass A; zwischen A, und A; gelegen ist. 
Setzt man 





(62) Pi=p, po = op, 

so ist also 

(63) 0<w<il 

und aus (57), (58) folgen die Gleichungen 

(64) $e = m + ms + mio + (1 — w) *} 


(65) 2wp' = mil — (1 — w) >} + (me + ms)o. 
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Daher geniigt w der algebraischen Gleichung fiinften Grades 
(66) mi{(1—w) * — (1 —w)} + mefo(1 —w) * — (1 —w)w *} + ms —w*) = 0. 
Schreibt man sie in der Form 


m + mw _ ms + ma(1 = w) 
m + mw? m3 + m(1 — w)?’ 








(67) 


so ist leicht zu sehen, dass sie genau eine Wurzel im Intervall (63) besitzt. Laisst 
man nimlich w von 0 bis 1 wandern, so wichst die linke Seite von (67) monoton 
von 0 bis 1 und die rechte Seite fallt monoton von 1 bis0. Hat man w bestimmt, 
so erhalt man p vermége (64) und dann j; , pe aus (62), wahrend pz = 0 ist. 
Aus (37) folgt jetzt 


(68) 2. = {memsw | + msm, + mym(1 — w) }p-. 


Man erhdlt die beiden anderen geradlinigen Falle, wenn man in (64) und (66) 
die Massen m , mz , m3 zyklisch vertauscht. 

Aus (59), (60), (62), (64), (66) ist nun ersichtlich, dass sowohl im gleichseitigen 
Fall als auch in den drei geradlinigen Fallen die Gréssen p; , pe, ps3 eindeutig 
durch m, mz, ms; bestimmt sind. Wir hatten bisher eine solche Folge t — 0 
betrachtet, fiir welche die Werte pe (k = 1, 2, 3) konvergieren. Da aber die 
pe fiir t > 0 stetige Funktionen von ¢ sind und nur jene vier isolierten Systeme 
von Haufungswerten jp; , f2 , ~; méglich sind, so konvergieren die py auch, wenn 
t beliebig gegen 0 strebt. Damit ist bewiesen, dass die Ausdriicke r,t" (k = 
1, 2, 3) fiir ¢ — 0 gegen positive Grenzwerte streben, namlich entweder gegen 
den durch (60) festgelegten Wert r des gleichseitigen Falles oder gegen die 
Werte wp, p, (1 — w)p der drei geradlinigen Fille, welche sich aus (64), (66) 
bestimmen, nach etwaiger zyklischer Vertauschung der Indizes. 

Wir wollen weiterhin von den 3 geradlinigen Fallen nur noch den durch 
(64), (66) fixierten studieren, da die beiden anderen durch Vertauschung der 
Indizes auf diesen zuriickgefiihrt werden. 


7. Ern SPEZIALFALL 


Bei den bekannten von Lagrange entdeckten speziellen Lésungen des Drei- 
kérperproblemes bewegen sich A;, Az, Az auf drei in einer Ebene gelegenen 
Kegelschnitten, wihrend das Dreieck A,A2A3 dauernd einem festen Dreieck 
ihnlich bleibt. Dabei ergeben sich fiir die Form des Dreiecks zwei Méglich- 
keiten: Entweder bilden A; , Az , A; die Ecken eines gleichseitigen Dreiecks oder 
sie liegen auf einer Geraden. Setzt man im letzteren Fall voraus, dass A» 
zwischen A, und A; gelegen ist, so erhalt man als Wert des Verhaltnisses der 
Strecken A2A3; und A,A; gerade die durch (66) definierte Zahl w. Hierdurch 
wird nahe gelegt, die unter den Lagrangeschen Loésungen enthaltenen Dreier- 
stossbahnen aufzusuchen, um dann die allgemeinen Dreierstossbahnen mit diesen 
vergleichen zu kénnen. 








= (). 
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on 
nt, 


6) 
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Man erhalt die Kollisionsbahnen unter den Lagrangeschen Lésungen, indem 
man die kegelschnittférmige Bewegung der Massenpunkte in eine geradlinige 
ausarten lisst. Dementsprechend machen wir den speziellen Ansatz 


m= gt), y= ngd (k = 1, 2,3) 
mit konstanten < , y und einer zweimal differentiierbaren Funktion g(t), die 
fir t > O positiv ist und fiir t = 0 verschwindet. Versteht man unter UO den 
Wert von U mit £. , % anstelle von x; , y; , so gehen die Differentialgleichungen 
(2) iiber in 

. .2 _ 00 ee 
(69) metdigg = — (tie = dn, Ju; k = 1, 2,3). 
Ow: 
Folglich ist der Ausdruck gg konstant, aber nicht 0, weil 
3 
» (4 0 + 20) -0#0 


k=1 ? Of; OY; 








ist. Dann kann man aber die Normierung 


(70) ig = -% 

treffen und auf die Gleichungen (69) die Uberlegungen anwenden, mit denen 
wir im vorigen Paragraphen die Relationen (54) untersucht haben. Wegen der 
orthogonalen Invarianz kann man noch “; > #3, J = Ys, J 2 O annehmen 
und erhalt fiir die Punkte (4 , 9) (k = 1, 2, 3) genau die Punkte A, , A: , As 
des gleichseitigen oder des geradlinigen Falles. Fiir die relativen Koordinaten 
&, — £3 = pr, Le — £3 = Po, Yo — Ys = ps gilt dann nach (59) im gleichseitigen 
Falle 


se ps = dr -V/3 


i= 


—r etn” pe sie 
und nach (62) im geradlinigen Falle 
pi = Pp, pe = WP, ps = 0, 


wobei 7, w, p durch (60), (66), (64) festgelegt werden. 
Die Integration von (70) ergibt 


g=t' te 


mit konstantem c. Wahlt man speziell c 
malige Integration 


erhalt man durch noch- 


I 
= 

D 

° 


g — p/% 
und damit als spezielle Dreierstosslésung 
a = 4,f°", ye = wit” (k = 1, 2, 3). 


Fiir einen spateren Zweck berechnen wir noch die zu dieser Lésung gehérigen 
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Werte von %, é2, j2. Nach (6) ist 


n=--™\n—-™ar, &= 
4=(1 m) m 22 L2 
n= (1-™) ir, 


me + 2ms3 1/3 : Ms — ™ _ -1/3 : m+m — 
er / i. = rt! y _ 8 ps 


~ A 3m a are 


also im gleichseitigen Falle 


und im geradlinigen Falle 


2 m2(1 _ w) + ms et -2 m3w — mi(1 = w) ot? 


3m ; 3m » t=O. 








(72) “i= 


8. REDUKTION DER DIFFERENTIALGLEICHUNGEN 


Um das Verhalten der Dreierstosslésungen bei t = 0 noch naher zu unter- 
suchen, miissen wir die in (55) definierten Gréssen p; , pe , p3 in die Differential- 
gleichungen (2) einfiihren. Zunachst bilden wir die Relativkoordinaten von 
A, und A? in bezug auf A; im ruhenden Koordinatensystem 


(73)hi$=u—-%, &8=n—-y, &B=wM-M, KH=¥2—¥s 
und setzen noch 

(74) mei = m, mihi = m, Mote, = 73, Moye = Mm - 
Nach (6) ist dann 

(75) mats; = —(m + 13), msys = —(n2 + ma), 

also nach (7) 


1 1 1 
(76) ‘—_ (ni + 3) + = (ns + ni) + x {(m + 13)” + (m2 + 1)’}. 


Ferner wird 
(77) n=8+8, nm=8H+8, = (&— &)’ + (&— &), 


also U eine Funktion von £, &, &, &allein. Die Differentialgleichungen (2) 
gehen dann iiber in das System achter Ordnung 


( 


‘ 1 1 1 : 
b=(44+2)n+ 3m, b= (242) m+ in, 
M3 Me ™M3 


Mm Ms ms 
; 1 1 1 : 1 1 1 
78 Te oe he me ~~ 5a eo ha 
(78) & (A +2)n+in, &4 (24+2)n+hm, 
dU 


”* =. 
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er- 
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Fiihrt man die Energie 
E=T-U 


ein, so hat das System (78) die kanonische Form 


(79) SJ hn = -% (kK =1,---,4). 
fe 

Um nun die Differentialgleichungen fiir die Relativkoordinaten im bewegten 
Koordinatensystem aus §6 aufzustellen, macht man am bequemsten von der 
Jacobischen Transformationstheorie Gebrauch. Nach dieser gilt bekanntlich 
folgender Satz: 

Es sei H eine Funktion von 2n Variabeln , , px (kK = 1, --- , n) mit stetigen 
partiellen Ableitungen zweiter Ordnung in der Umgebung einer Stelle, an 
welcher die n-reihige Determinante 








aH 
: 4 
(80) Op: On: | 
ist. Dann wird durch den Ansatz 
0H 0H 
(81) = — he = (k = 1, ---,n) 


Onx’ x Ope 


eine Variabelntransformation definiert, welche das Hamiltonsche System 


dE dE 
_ : ~— k = 5 eee ,n 
& am 1k abs ( ) 
in das Hamiltonsche System 
dE dE 
Le, i le, (k = 1, ---,n) 
Pk Ogk qk Op: 


iiberfiihrt. 

Bedeutet p, den Winkel zwischen der ruhenden Abszissenachse und der 
Richtung A3A,, so bestehen zwischen £,, &, &, & und den in (55) erklarten 
Relativkoordinaten p,, 0, pe, ps im bewegten Koordinatensystem die Glei- 
chungen 


—i SIN Ps , £3 = Po COS 4 — Ps SIN Pa, 


£1 = pi COS pu, £2 
& 


(82) 


P2 SiN ps + Pz COS M4. 

Wir wahlen 

H = mp; cos ps + mop sin ps + 73(p2 COS Ps — Ps Sin ps) 

+ ma(pe2 sin ps + ps3 COS pa) 


und wenden den Transformationssatz fiir n = 4 an. Eine leichte Rechnung 
ergibt fiir die Determinante D in (80) den Wert p;, und dieser ist von 0 ver- 
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schieden, solange keine Kollision vorliegt, also fir t > 0. Die erste Gleichung 
in (81) ist wegen (82) erfiillt, die zweite Gleichung ergibt die Formeln 






(83) qm = mcos ps + msin ps, G2 = 93 COS Ps + m SIN Mu, 
ds = —% sin Ps + na COS 74, 
(84) qs = — mpi Sin ps + Pi COS Ps — s(P2 Sin Ps + Pz COS Ps) 






+ m4(P2 COS Ps — Pz Sin p,), 






also 








(85) Gs = pi(—m Sin Ps + m2 COS Ps) + Dogs — PaQe . 






Setzt man zur Abkiirzung noch 





(86) qo = (Psqz — P2gs + q4)Pi » 
so ist nach (85) 







Go = —m Sin px + me COS Pa, 


und in Verbindung mit (83) folgt 






m = gi COS Ps — Qo SiN Pu, n2 = gi Sin p, + Go COS Pa, 





(87) 






ns = G2 COS Ps — Q3 SIN Pa, 4 = Go Sin Px + Qs COS Pa. 


Daher gilt nach (76) 






1 1 1 
(88) T= - (qo + qi) + on (q2 + 43) + _ {(qo + 4s)” + (1 + &)"}, 





ferner nach (77) und (82) 





(89) ri=pet+Dps, m=, 1 = (pi — pr) +73. 





Auf Grund des Transformationssatzes geht das System (79) durch die Substi- 
tutionen (82), (87) iiber in 






: dE dE 

Pk aan Vk on, 
wobei EH = T — U nach (88) und (89) als Funktion der p, , gq (& = 1, --- , 4) 
anzusehen ist. Mit Riicksicht auf (86) ist ersichtlich, dass E die Variable ~ 
nicht enth4lt, also 


(k = 1,-+-,4), 









dE 
—— <= @ 
Ops 







ist. Die zweite Gleichung (90) ergibt fiir k = 4, dass q, konstant ist. Dies ist 
gerade die Aussage des Flichenintegrals, denn aus (73), (74), (75), (82), (84) 
folgt 






3 
Ga = —mébe + nebi — nbs + me = »» m(Ie Ye — Yutx); 














ung 


D1) ’ 
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also ist gs die Konstante ys aus (5). Nach §5 ist daher fiir jede Dreierstossbahn 


(91) a= 0, 
und wir erhalten fiir p. , qe (k = 1, 2, 3) das System sechster Ordnung 
=) (2%) 
= —_ F = = — k = 1, 2, 3 ° 
te (= a0 ” Ope] aa—0 ( 


Ist dieses integriert, so ergibt sich p, durch Quadratur aus 


a (5) 
Ps 0q4 “=O 


9. DAs ASYMPTOTISCHE VERHALTEN DER px UND Qi 
Wir haben in §6 bewiesen, dass die Ausdriicke Pi = pt” (k = 1, 2, 3) fiir 
t— 0 Grenzwerte p, haben. Nach (59) ist im gleichseitigen Falle 


(92) pa ~ rt”, p2 ~ Ht”, Ds ~ “ rt, 


wobei r durch (60) gegeben ist, und nach (62) im geradlinigen Falle 


2/3 
’ 


(93) Pi ~ pt pe ~ wot”, ps = o(t””), 


wobei die Zahlen w, p in (66), (64) festgelegt sind. 
Wir wenden uns jetzt zur Untersuchung des asymptotischen Verhaltens von 
hm, d,s. Schreibt man zur Abkiirzung 


ree 


(94) COS Ps = yw, sin 4 = », 


te tthe ge 


so ist nach (74), (75), (87) 
(mt. = qi — Qo», Ma k2 = Go — Ys, 


M3i3 = —( + Jo) + (qo + qs), 





(95) 
mY = giv + gon, Meio = gov + su, 





MsYs = —(qi + q2)v — (Go + 4s)m. 
Nach (6), (73), (82) ist ferner 


( 


mx, = {(m2z + ms3)pi — Mepe}u + mepsr, 








my: = { (m2 + ms)pi — Map2}v — Mepsp, 
(96) 4 maz = {(m, + ms)p2 — mypijm — (m, + ms)psr, 
my2 = {(m, + ms)p2 — mipi}y + (mi + ms)psn, 
mas = —(mypi + mepo)u + mopar, mys = —(mrpi + mep2)v — Mapu. 





wee 
ak crit 
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Hieraus folgt mit Riicksicht auf (10) 

bd = pig + page + pags , 
also nach (25) 
(97) pign + page + pags ~ 9rt”, 


wobei A im gleichseitigen Falle durch (61), im geradlinigen Falle durch (68) 
festgelegt wird. Ausserdem ist nach (38) 


nth — yrti = o(t'"), ate — yatta = o(t'”), 
also 
(98) {(m2 + ms)p: — maps} qo + mapsqi = o(t’), 
(99) {(m, + ms)p2 — mipi}gs — (m + ms)page = o(t’”*). 
Zufolge (43), (83), (86) ist 
(100) ga = Ot”) (k = 0, ---,8). 


Im gleichseitigen Falle folgen aus (91), (92), (97), (98), (99), (100) die Bezie- 
hungen 


(101) 2a + @ + V3 ~ grr tt, 
(102) (mz + 2ms)(G2V/3 — gs) + 2mgrvV/3 = o(t””), 
(102) (ms = m1) Qs _ (m, + Ms) 2/3 = o(t**). 


Setzt man 


1 
qs = vB ma(m, + ms)q, 


so wird nach (103) 
= 3m2(m; — m)q + o(t *”) 
und nach (102) 
= 9m(m2 + 2ms)q + o(t ”), 
also nach (101) 
$(mymz3 + msm, + mM2)q ia gr? 
woraus nach (60), (61) 


r 1/3 
q~wit 
m 








(68) 


»Z1e- 
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und folglich 
n~ z (mz + 2m,)ré“”*, q~ z (ms — m)rt*”, 
(104) sea ios 
qs ™ av (m, + ms)rt 
sich ergibt. 
Im geradlinigen Falle ist nach (86), (91), (93), (98), (100) 
—{m2(1 — w) + ms}wgs = o(t ™”), 
also 
(105) qg=o(t”), gw =o(t”). 


Nach (38) ist ferner 
myma(art_ — roti + yrs — y2tn) = o(t””), 

also nach (95), (96), (105) 
(106) my {ma(1 — w) + ma}ge + ma{m (1 — w) — mw}qr = o(t'”). 
Setzt man diesmal 

qi = m{m(1 — w) + ms}q, 
so wird 

G2 = me{myo — m,(1 — w)}g + o(t'”) 
und nach (97) 
(107) {memge” + mgm, + mym2(1 — w)*}q ~ Fp 't 


Multipliziert man (64) mit m{m(1 — w) + ms} und (65) mit m{mw — 
m(1 — w)}, so folgt durch Addition 


—1/8 


§{memyo” + mam, + mym2(1 — w)"}p? = {memyw + mgm, + mym2(1 — w)*}m. 
Benutzt man noch (68), so geht (107) iiber in 
is 2p -13 
q 3 pa t ° 
Demnach ist jetzt 
2 ue 
n~ a {m2(1 — w) + ms}pt ”, 
(108) 


2 7 i 
— {mgw — mi(1 — w)} pt”, qs = o(t”*). 


=“ 3m 
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Durch (104) und (108) ist das asymptotische Verhalten von q:, q@, q3 im 
gleichseitigen und im geradlinigen Falle festgestellt. Dass auch p, einen Grenz- 
wert fiir ¢ — 0 hat, wird sich erst im spaiteren Verlauf der Untersuchung ergeben. 

Man kann (104) auch mit etwas geringerer Rechnung aus den Formeln (101), 
(102), (103) erhalten, indem man von §7 Gebrauch macht. Da namlich die 
auf den linken Seiten dieser Formeln stehenden linearen Formen von q , q , qs 
linear unabhingig sind, so sind die Grenzwerte von qt” (k = 1, 2, 3) firt 0 
eindeutig bestimmt. Also kann man sie durch Betrachtung des speziellen 
Falles von §7 ermitteln. Dort ist aber p; = 0 und folglich nach (95) 


QQ = mx, q2 = Moke, G3 = Me¥e. 
Aus (71) folgt dann (104). Analog kann man im geradlinigen Falle aus (72), 
(97), (105), (106) ohne weitere Rechnung auf (108) schliessen. 


10. Dire CHARAKTERISTISCHE GLEICHUNG 
Wir machen in den Differentialgleichungen (90) die Substitutionen 


* 2/3 * .—1/3 
Pe=pl, n= Kt (k = 1, 2,3), 
(109) | 


* 1/3 


* 
Ps = Pa, a=qat ’ 
(110) 


Nach (86), (88), (89) ist Z eine Funktion von Pe, Me (kK = 1, --- , 4), in welcher 
ps nicht auftritt. Ersetzt man darin die Variabeln p,, g, durch pg , ge , 80 
mége E* entstehen, und zwar ist 


E* = Ef”. 
Das System (90) geht dadurch iiber in 


oE* 
: ~“—_, = —3g + —; k = 1, 2,3), 
ds Pp. age ds 39k + ape ( 
dp; _ _0E* dai _,« , OE 
las sacs — = 3% + — |. 
ds 0q4 ds Op, 


dp, _,+ dE daz 
Pe Ae 


(111) 


Fir t — 0 ist s— ». Wie im vorigen Paragraphen gezeigt wurde, haben 
bei diesem Grenziibergang die Ausdriicke pp und ge (k = 1, 2, 3) bestimmte 
Grenzwerte p; und gq. Im gleichseitigen Falle ist nach (92) und (104) 


—r ie J Pe — ar, Ds — arvV/3, 


‘ m . 
an = — (mz + 2ms)r, a i (ms — mi), 


qs = a2 (m, + ms)r, 


m/3 














3 im 


enz- 
ben, 
01), 
die 
] qs 
+0 


len 


2), 
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im geradlinigen Falle nach (93) und (108) 


fi = Pp, Pz = wp, ps = 0, 
2 . 2 
(113)4qi = oe {m2(1 — w) + ms}p, qe = — {msw — m(1 — w)}p, 
3m 3m 
qs = 0. 





Nach §7 kennen wir eine spezielle Dreierstosslésung, niamlich 


KE = qt (k 7 4, 2, 3), Ps = Y, “a= 0, 


Pi sie prt” 
wobei also p, , G: (k = 1, 2, 3) im gleichseitigen Falle durch (112), im gerad- 
linigen Falle durch (113) aigulven sind. Wir erhalten also cine » spezielle Lésung 
des Systems (111), eine Gleichgewichtslésung, wenn wir Pe s oh konstant gleich 
Dis Me (k = 1, 2, 3) und py = 0, gs = Osetzen. Um simtliche Dreierstossbahnen 
zu seemiaiitia: haben wir saémtliche Lésungen von (111) zu untersuchen, fiir 
welche py , “ (k = 1, 2, 3) die Grenzwerte p, , @ haben, wenn s iiber alle 
Grenzen wachst. 
Wir setzen noch 


’ 


(114) De — Di: + bx , rh = dk + i438 (k = 1, 2, 3), ps _ 5s , G = 6;. 


Fiir geniigend kleine Werte der absoluten Betrage von 6; , 62, 53 lisst sich E* 


in eine Reihe nach Potenzen von 6, --- , 6; entwickeln, deren Koeffizienten 
noch von m,, m,, m3 abhangen. Die Differentialgleichungen (111) gehen 
dadurch tiber in ein System der Gestalt v 

dé my 
(115) = x, ere (kK = 1,--- , 8), 

ds i= 

; : 

WO a: (k, l = 1, --- ,8) Konstante bedeuten und ¢;, --- , gs Potenzreihen i = Hee 
5, --- , ds ohne A ee und lineare Glieder. Da E* nicht die Variable De i : 


enthalt, so gilt nach (111) 
(116) ag =~O(kK =1,---,8), ay =}, ax =O(lL #7), @ = 09. 


Es bedeute I die achtreihige Matrix aus den Elementen a,,, ferner % die 
Untermatrix von MM, die durch Streichung der beiden letzten Zeilen und Spalten 
aus Jt hervorgeht. Wird eine Einheitsmatrix mit € bezeichnet, so sind zufolge 
(116) die charakteristischen Polynome 


G(z) = |zE— MI, F(z) = | zé — A| 
der Matrizen MN und % durch die Gleichung 
(117) G(z) = 2(z — 4)F(z) 
verkniipft. 





are a 
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Fir jede Dreierstosslésung ist gs = 0, also 6; = 0. Es sei y, der Wert von g, 
fir 6; = 0. Wir haben dann das System 


6 
is = ps and: + vx 
ds i=1 

dds 
ds 


zu lésen, unter der Bedingung 6, — 0 (k = 1, --- , 6). firs — «©. Auf den 
rechten Seiten dieser 7 Differentialgleichungen tritt 6s nicht auf. Hat man 
also das System (118) vollstandig gelést, unter jener Bedingung, so ergibt sich 
5s; = ps aus (119) durch eine Quadratur. Zur naheren Diskussion von (118) 
ist die Kenntnis der charakteristischen Wurzeln der Matrix % notwendig. Die 
direkte Berechnung der Determinante |z€ — %| ist recht miihsam, da die 
Koeffizienten a;, sich nicht bequem bestimmen lassen. Einfacher erhalt man 
F(z) durch folgende Uberlegung: 

Bezeichnet man die rechten Seiten der Differentialgleichungen (115) mit 4, 
(k = 1, --- , 8), so hat man das System 

db, 


(118) 


6 
(119) = d g:51 + Ys 


und Yt ist die Funktionalmatrix (#) an der Stelle 6; = 0, --- ,53 = 0. Man 


l 
betrachte jetzt 6, , --- , ds als zweimal stetig differentiierbare Funktionen von 8 


neuen Variabeln 6, --- , 63, und zwar moége fir das Wertsystem 6, = 4 
(k = 1, --- , 8) speziell 6, = 0 (k = 1, --- , 8) sein und die Funktionaldeter- 
minante der 6, beziiglich der Variabeln 6 an der Stelle 6, = 6, nicht ver- 
schwinden. Durch diese Transformation gehen die Differentialgleichungen 
(120) iiber in 

d, _ +> Or 

ds g=1 06, . 
wobei die rechten Seiten als Funktionen der 6, anzusehen sind. Bezeichnet 
man diese rechten Seiten zur Abkiirzung mit ¥, (k = 1, --- , 8), so ist 


OW, _ >> me . (3) =. 80, Ib, d5, 


00 g=1 * 06; 36, ona 05, 5p 80; 


(E = 1, -:- , 8), 


(121) 


Es bedeute $ die Funktionalmatrix (*) an der Stelle 6; = 6,,--- , 03 = 6s 


l 


und @ die Funktionalmatrix (=) an derselben Stelle. Da der erste Summand 
l 


auf der rechten Seite von (121) an dieser Stelle verschwindet, so ergibt (121) die 
Beziehung | 


B= G'MG 
und folglich ist 
(122) |zE— $| =|zE-— G'MG| = | G"(2E — M)G| = | zE — M| = Gz). 





den 


1an 
ich 
18) 
Die 
die 


an 


$, 
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Um diese Formel zur Berechnung von G(z) anzuwenden, setzen wir noch 


(128) b= et’, om = me (k= 1,---,4), 
qo= got”, Go = (Pade — Pais) Pi’. 
Nach (82), (86), (87), (109) ist dann 
i= up, & =, t = up: — vp; , Ef = vps + ups , 


* x * * * * * * * * x 
m= uh — Go, 2 = 9 +490, 3 = M92 — 9s, m = G2 + Hugs, 


wobei », v in (94) erklart sind. Zufolge (114) sind die te , m Funktionen von 
é,, --- , 6s, die an der Stelle 6, = 0, --- , 5s = 0 die Werte 


1=pr, $££=-0, B=m, &=Ds, h=h, h=h, tB=h, t= Gs 
haben. Endlich sei 





f 6. = & (k =1,---,4) 
1 &\ oi 330 i 1 1\ «+ 1s 
(124) 48 = (-+32) ee ee (-+2) ng 
1 1\is' ‘1 ve 1 4 grup’ & 
=_lUc ——_ —_— —_— —o: a= — —_— —_— _— —— a 
67 (= + +) "3 or m1 8 (2 + “a n4 a Ne 


Durch eine einfache Rechnung erhalt man fir die Funktionaldeterminante der 
6, als Funktionen der 6, den Wert m?(mymzms)” * 0. Vermdge der Substitu- 
tionen (123), (124) gehen nun die Differentialgleichungen (78) tiber in 


d@ do 
(125) Ge = toe + Ons, —att = — Mes — Gh = 1,--.,@ 


mit 
G, = m263R;, + (m, + ms)0:Re + m2(O; — 03)Rs , 
G2 = meh, + (m, + ms)OeR2 + me(b2 — O4)Rs , 
= m0,R, + (me + ms3)O3R2 + 11(03 — 6:)Rs , 
Gs = m02R, + (me + ms3)ORe + mi(0, — 62)Rs , 
R, = (63 + 63)", R, = (62 + 6) **, Ry = {(0: — Os)° + (02 — HY”. 


S 
] 


Fiir die Werte der Ableitungen 


OG _ (k,U = 1, ---,4) 


an der Stelle 
(126) A=p, 6&=0, KX=fr, K=Ps 
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findet man nach (112) im gleichseitigen Falle 


V9 ne, 0 ine 


2 
3/3 3/3 — 


Om, m™ + Ms — FM, — —— 


ie — 2(mi + ms), 


‘ 37/3 
— im, ut mx, ¢(m + mz + ms), ai (m, — ms — ms), 





| oe m, 2m, _ (m, — m2 — ms), —$(mi + me + ms), 
3 


und nach (113) im geradlinigen Falle 

[ —2(my + ms) — 2maw *, 0, 2ma{w * — (1 — w)*}, 0, 

0, m + m3 + mw *, 0, m{(1 — w)* —w *}, 

—2m (1 — w *), 0, —2miw * — 2(m, + ms)(1 — w), 0, 
0, m(1 — wo *), 0, mw * + (ame + ma)(1 — w)™, 


3 
PCy = 





wo die Gréssen 7, w, p durch (60), (66), (64) fixiert sind. 

Bezeichnet man zur Abkiirzung die vierreihige Matrix (c;,;) mit ©, so hat die 
Funktionalmatrix der rechten Seiten von (125) als Funktionen von @,, --- , 9% 
an der Stelle (126) die Gestalt 


a 
—1¢}’ 


wo die rechts auftretenden Matrizen vierreihig sind, und nach (122) wird 


aE -E 
(+ ~€ 


Diese vierreihige Determinante bestimmt man durch direkte Rechnung unter 
Benutzung der angegebenen Werte der c,;. Es ergibt sich ein einfaches Re- 
sultat. Setzt man 


| 
pe TORS OE GE 


(2 + 3)(2 — %) =2, 
so wird im gleichseitigen Falle 
(127). Gz) = (x + 3)(@ — $)(@" — 3a — & + 4a) 
mit 


(128) q = M2ms + msm + mm 
(mi + m2 + ms)? 
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und im geradlinigen Falle 





(129) G(z) = (x + $)(z — $)(z + $ + $)(x — $ — $d) 
mit 

m{1+ (1 —w) + (1 — ow)” 1+ mall +wo'+w”’) 
on wa m + me{wo” + (1 — w)*} + m, 
Da 


2-9 = 2+ 
ist, so erhalten wir nach (117) das charakteristische Polynom F(z) der Matrix 
%, indem wir auf den rechten Seiten von (127) und (129) den Faktor x + § 


fortlassen. 7 
Die charakteristischen Wurzeln von % sind also zufolge (127) im gleichseitigen 


Falle die 6 Zahlen 


(—a=-% —m =${1—[13 + 12(1 — 3a)*}}, | 
(131) — ae = ${1 — [13 — 1201 — 3a)'}'}, as = 4{1 + [13 — 12(1 — 3a)'}'}, | 
| ae = {1 +[13 + 120. —3a)'}, = as = 1. 


Wegen der Beziehung 

2m7(1 — 3a) = (m2 — ms)” + (m3 — m)° + (m, — m2)” ; 
ist a S 3, und zwar a = 3 nur fiir m = me = mz; andererseits ist a > 0. 
Folglich sind die 6 Wurzeln saimtlich reeH, und zwar —a), —a, , —4@2 negativ, 


a3, 44, @s positiv. Ferner sind sie simtlich verschieden, ausser im Falle m, = 





mM, = M3, WO —d, = —a2 und az = ay wird. 
Im geradlinigen Falle ergeben sich aus (129) die Wurzeln 
figs | ee 20k bos: sea so 16b)'], be = {1 — (1— 86)4], 


[bs = {1 + (1 — 8b)}], by = 3{1 + (25+ 16b)'], bs =1. 


Von diesen sind —bp und —b, negativ, bs und bs positiv, be und 63 entweder 
positiv oder konjugiert komplex mit dem positiven Realteil 3. Die Wurzeln 
sind alle verschieden, ausser im Falle b = 3, wo bs = b3; = 4 wird. Die Gleichung 
b = $ liefert eine algebraische Bedingung fiir m, , mz, m3, die nicht identisch 
erfiillt ist. Wahlt man z.B. m, = m3, so ergibt (66) pes Wert w = 3, und aus 


der Annahme 6 = } folgt nach (130) die Bedingung ~ — = = 2. Also sind im 


allgemeinen die Wurzeln simtlich verschieden. 

Dass die Werte —2 und 1 unter den Wurzeln auftreten, hatte man auch 
ohne Rechnung aus dem Energieintegral entnehmen kénnen. Dagegen lasst 
sich die Bestimmung der tibrigen Wurzeln wohl kaum einfacher durchfiihren, 
als es hier geschehen ist. 
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11. ASYMPTOTISCHE BAHNEN 


Wir betrachten ein System von Differentialgleichungen der Form 


ta = Dd andr + 


ds i=1 


(133) 


Hierin seien die a,, reelle Konstante und die y Potenzreihen der Variabeln 
6,, --- , 6, mit reellen Koeffizienten, welche in einer gewissen Umgebung des 
Nullpunktes konvergieren und weder konstante noch lineare Glieder enthalten. 
Es seien \,, --- , A, die charakteristischen Wurzeln der Matrix (a2) = % und 
p; der reelle Teil von A, (k = 1,---,”). Von diesen reellen Teilen seien p 
negativ und n — p positiv, also keiner gleich 0. Wir denken uns die Wurzeln 
so angeordnet, dass 


(134) O>m2m2--- 2 pp, Poti = Ppxe 2 +: 2 Pn > O 
ist. 

Nach einem bekannten Satze der Elementarteilertheorie gibt es eine Matrix 
= (hz), sodass die Matrix 


(135) © AS = QD 


die Normalform besitzt. Zunachst werde angenommen, dass die Wurzeln ), 
(k = 1,---,m) simtlich verschieden sind. Dann ist 0 die Diagonalmatrix 
aus den Diagonalelementen \,;,---,An. Man kann noch voraussetzen, dass 
fiir je zwei konjugiert komplexe Wurzeln \; und , auch die j und die k“ Spalte 
von § zueinander konjugiert komplex sind. Macht man die Substitution 


(136) 5. = DD hia $i 


so sind 6, --- , 6, dann und nur dann reell, wenn fiir jedes Paar konjugiert 
komplexer Wurzeln \;, \; = A; auch stets ¢ = §&; gilt und fiir reelles \, auch 
¢, reell ist. Wir wollen weiterhin nur solche Werte der Variabeln ¢; betrachten, 
die diesen Bedingungen geniigen. 

Durch die lineare Substitution (136) geht das System (133) wegen (135) 
iiber in 

agi, 

137 wt 
(137) ,* Ne Se + Xe 
dabei sind die x, = xu(f1, --- , fn) Potenzreihen in ¢, --- , ¢, ohne konstante 
und lineare Glieder, welche den Gleichungen 


vy = > hexi 


geniigen. Fir A, = ; ist also auch x, = xX;. 





sk 
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Wir wollen das System (137) weiter vereinfachen durch Substitutionen der 


Gestalt 
(138) Ue = Se — Palo, --+ , Sp) (A= 1,---,n), 
wo die P; Potenzreihen der ersten p Variabeln £1, --- , {» ohne konstante und 


lineare Glieder bedeuten. Wir setzen diese Potenzreihen zunichst mit unbe- 
stimmten Koeffizienten an und wollen diese dann rekursiv eindeutig festlegen 
durch gewisse Bedingungen. Die Untersuchung der Konvergenz werden wir 
nachtraglich durchfiihren. Fiihrt man die Variabeln u, in die Differential- 
gleichungen (137) ein, so erhélt man das System 


(139) i Ae Ur + Qx(u, oS Un) (k ae ‘, oe n) 
mit 
Qe = xe tru Pi — p Sent *E , Ga + Arf), 


wobei in xz, Px, , xz, 1 die Variabeln u, --- ,u, durch Auflésung von 


(138) nach {1,---,¢n einzutragen sind. Die Q, sind dann offenbar Potenz- 


reihen in u, --- , u, Ohne konstante und lineare Glieder. Nun wollen wir die 
Koeffizienten der P; so zu bestimmen versuchen, dass in keiner der Reihen Q; 
Potenzprodukte der p Variabeln wu, --- , up allein auftreten; es soll also iden- 
tisch 
(140) | Qi(ui, ---,Up, 0,--- ,0) = 0 (kK = 1,---,n) 
gelten. 

Nach (138) sind {1, --- , {» Potenzreihen der p Variabeln 1, --- , Up allein, 
und fiir upy1 = 0, --- , Un = O ist ausserdem 

fe = Pilfi, --- 5 fp) (k=pt+1,---,m). 


Daher sind die Bedingungen (140) gleichbedeutend mit 
xe($1, eas » Sp, Pox, sing i Pal +r Pi 


(141) aP 
-> Be, (XGry -++ Soy Pon, «++» Pa) + ahi} = 0 
firk = 1,---,n, identisch in ¢,,---,¢>. Wir zerlegen 
= >> Pre (k =1,-+-,n), 
q=2 
wo Pi, ein homogenes Polynom q*” Grades in {1 , --- , {p bedeutet; entsprechend 


sei 2, der homogene Bestandteil g‘” Grades in {1, --- , ¢p auf der linken Seite 
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von (141). Dann gilt offenbar 









Pp 
(142) Rig = Pig — t® oP ee 


r Ha; 
A ae; $i + Hig 























wo H,, ein Polynom in {1 , --- ,$>,Pu(lL=pt+1,---,nj;h = 2,---,q-1) 
und a (l= 1,---,p;h = 2,---,q — 1) ist. Es seien bereits alle Py, fir 
1 

l= 1,---,nundh = 2,--- ,q — 1 bekannt; diese Annahme ist inhaltslos fiir 
q = 2. Dann haben wir P;, (k = 1,---,) zufolge (141), (142) aus den 
Bedingungen 

-. oP 
(143) —Ne Pro + DU Mist = Hin 

i=1_ OC7 










zu ermitteln, deren rechte Seiten bekannt sind. 
Ist c¢{' --- (4? ein Summand von H;,, , so ergibt sich fiir den Koeffizienten o 
des entsprechenden Gliedes von P;,, aus (143) die lineare Gleichung 


Pp 
| (-» + » ar)e = ¢, 


und danach erhalt man o, und zwar eindeutig, wenn 






re acy ee 









Pp 
(144) > gis ¥ x 


l=1 








ist. Dagi, --- , gp nicht-negative ganze Zahlen mit der Summe g = 2 und die 
reellen Teile von \1, --- , Ap negativ, die von Api, --- , An positiv sind, so ist 
(144) fir k > p stets erfillt und fir k = 1, .-- , p jedenfalls bei geniigend 
grossem g. Wir setzen zunichst voraus, dass (144) ausnahmslos erfillt ist, fiir 
k = 1,---,m und alle Systeme nicht-negativer ganzer Zahlen gi, --- , 9p, 
deren Summe grosser als 1 ist. Dann sind also die Potenzreihen P; auf genau 
eine Weise so bestimmbar, dass (140) erfiillt wird. 

Ks muss nun untersucht werden, wie es mit der Konvergenz der formal 
gefundenen Reihen P;, bestellt ist. Diese Untersuchung erfolgt am bequemsten 
mit der Cauchyschen Majorantenmethode. Sind P und Q zwei Potenzreihen 
in den gleichen Variabeln, so soll das Zeichen 


P<Q 










= ee es 
pt Natty Ne cle ni. 












bedeuten, dass fiir je zwei entsprechende Koeffizienten a und 8 von P und Q 
die Ungleichung 






je| <6 


gilt. Nach Voraussetzung sind die Potenzreihen y; in (133), also auch die 
Reihen x; in (137) konvergent in einer gewissen Umgebung des Nullpunktes. 
Bedeutet C; , wie auch weiterhin C; , --- , Cs eine geeignete positive Konstante, 






(1 


un 


Di 


ert 


Di 





iir 


lir 


t 


v 
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so ist also 


Chi t--s ton) - | 
1 — Cilfi +--+ + £n) = fh, -+ 5h) (k= 1,---,n), 


wo die rechte Seite durch ihre Potenzreihe zu ersetzen ist. 
Wegen (144) ist nun 





Xk < 





| Pp 
(145) b+ gits-s +9 <C2|—me + Dori], 
fir k = 1, --- ,n und jedes System nicht-negativer ganzer Zahlen gi , --- , gp, 


deren Summe mindestens gleich 2 ist. Wir ersetzen (141) durch die abge- 
inderten Gleichungen 
Pp oe Pp * 
oP. 
P; + > Pe og, = o(1 + 3 eV Fc —_ Tei Pies —s , P®) 

(146) i= Of1 i=i_ OF; 

(k = 1, eee, n) 

fiir n unbekannte Potenzreihen Py (f1, ---, fp). Diese lassen sich durch das- 


selbe rekursive Verfahren bestimmen wie die P;, und man entnimmt durch 
Vergleich von (141) und (146) wegen (145) unmittelbar die Beziehung 


(147) P. < Pi (k= 1,---,n). 
Aus (146) ersieht man, dass alle Py (k = 1, ---,m) identisch gleich sind; es 
sei P* der gemeinsame Wert. Setzt man noch ¢; = {2 = --- = fp = §, so 


mége P* in P(¢) tibergehen; ferner sei 


_ Cpe + (n — p)P}? 
IGP) = [Gr t+ @— DPI 





Offenbar ist dann 
(148) P¥(61,---, Sp) < Pi +--+ + op) 
und nach (146) 


OF 
dg 


Durch Einsetzen der Potenzreihe 


P(g) a rs d,¢* 


dP 
P+¢ Cs (1 + ws, P). 


erhalt man 
> (q + 1) d,¢* = 0,(1 + p> adat*™) I, P). 


Die hieraus entstehenden Rekursionsformeln fiir die Koeffizienten d, zeigen, 





EN ee pnts + 
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dass 
(149) P< ¢M, 


ist, wo M, der kubischen Gleichung 


- C3x(E + ¢M,)” 
fM, = i— Ge + tM) (1 + M;) 





geniigt. Nun ist 


(1 + Mi)” 1 
i=~GratD *i-cgem “<~™ 





und folglich 

(150) M, < M: 
mit 

Cs 


M: = om, 


Aus der Gleichung 


(1 — 20M)? = (1 — 40%) 


erhalt man endlich 
(151) M: < M 
fiir 
1+ 4C;M = (1 — 4C%g)", 
M = Csf(1 — 455)", 
also nach (147), (148), (149), (150), (151) 


CeolSi +--+ +f)” 
1 — Co(fi + --+ + Sp) 


Damit ist aie Konvergenz der Potenzreihen P, fiir hinreichend kleine Werte der 
absoluten Betrage von {,, --- , {» bewiesen. 

Schliesslich sind noch die Realititsverhaltnisse zu untersuchen. Aus der 
Lésung P;,---,P, von (141) entsteht eine weitere Lésung, indem man fir 
jede reelle Wurzel ,, die Grésse P, durch P, ersetzt und fiir je zwei konjugiert 
komplexe Wurzeln \; , \; das Paar P;, P, durch P;,, P;. Da aber die Lésung 
eindeutig festgelegt ist, so gilt P, = P, im ersten Falle und P, = P; im aweiten 
Falle. Nach (138) ist also u, reell fiir reelles , und uw = @; fiir \» = 4;- 

Wir wollen nunmehr saémtliche Lésungen der Differentialgleichungen (133) 
bestimmen, die fiir s > © in den Nullpunkt einmiinden. Nach (136) und (138) 
geniigt es, diese Untersuchung an dem System (139) vorzunehmen. Da s nicht 
explizit in den Differentialgleichungen auftritt, so kann man voraussetzen, dass 


P, < 








Rae =— 
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die gesuchten Lésungen fir alle s 2 0 der Ungleichung 


(152) »» lu? <e 


geniigen, wo ¢ eine hinreichend klein zu wahlende positive Konstante bedeutet. 


Setzt man 
(153) > |u|? = Z, 
k=p+1 
so gilt zufolge wr 
alse (Ax + Ap)ue te + = (wQu + te Qx). 


< k=ptl 


(154) 


Nach (140) ist nun jedes Glied der Potenzreihen Q, und Q, durch eine der 
Variabeln upy1, --+ , Un teilbar, also jedes Glied der Potenzreihe fiir den zweiten 
Summanden auf der rechten Seite von (154) durch ein Produkt zweier dieser 
Variabeln teilbar. Da diese Potenzreihe mit Gliedern dritter Ordnung beginnt, 
so ist nach (152) und (153) fiir geniigend kleines ¢« die Ungleichung 





(155) > (uxQi + %Qi) = —prZ 
k=p+1 

erfiillt, also nach (134), (153), (154) 

d. 

=f 22 > pr| Ux | — prZ = pnZ, 

k=p+1 
d(Ze °**) > 0, 
ds 


Daher ist der Ausdruckt Ze °™ fiir alle s 2 0 monoton wachsend; andererseits 


strebt er fiir s — © nach 0, weil p, positiv und Z < e ist. Folglich ist fiir die 
gesuchte Lésung Z = 0, also 


(156) tu = (k =p+1,---,n). 
Nach (140) reduzieren sich jetzt die Differentialgleichungen (139) auf die ein- 


fache Form 


(157) — = Aue (k = 1,---, p) 
ds 


mit der Lésung 
(158) UuU = a,e* (k = 1, A » P)- 


Dabei ist: a, reell fiir reelles 4, und a; = &; fir \, = 4;. Da die Realteile von 
Ma +++ , Ap negativ sind, so miindet die durch (156) und (158) gegebene Lésung 
bei beliebigen a, tatsichlich fir s > © in den Nullpunkt ein. 
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Durch die Bedingungen (156) wird eine analytische Flache von p Dimen- 
sionen im Raume der 6,, --- , 5» definiert. Man erhalt eine Darstellung der 
a Flaiche durch die Parameter {1, --- ,£», wenn man in die Ausdriicke 





bk = Dy hat (k = 1,---,n) 


l=1 








fiir fpi1, --- , fn die aus (138), (156) folgenden Werte 
fe = Pil&i, --- , Sp) (k=pt+1,---,n) 
ce eintraigt. Will man eine reelle Parameterdarstellung haben, so hat man das 





Paar ¢; , ¢% fiir Xx = A; durch (¢; + :)/2, (6; — £%)/2i zu ersetzen. Die gesuch- 
ten Bahnkurven erfiillen dann genau diese p-dimensionale Flache, und zwar 
erhalt man die einzelnen Lésungen, indem man vermdége der Gleichungen 


Un = Se — PlSi, +--+ 5 Fp) (KA = 1,---,p) 


die Gréssen £1, --- , {> als Potenzreihen von mu, --- , u» bestimmt und dann 













ARS 
Ur = aye 





setzt, mit beliebigen Konstanten a; (k = 1, --- , p) unter Beachtung der Reali- 
titsbedingungen. Die allgemeine Lésung hangt also von p reellen Para- 
metern ab. 

* Es seien noch einmal die beiden Voraussetzungen erwahnt, die wir in diesem 
# Paragraphen an verschiedenen Stellen der Untersuchung eingefiihrt haben, 
namlich die Verschiedenheit der Wurzeln 1, --- , An und das Bestehen der 
Ungleichung (144). Wir wollen noch feststellen, in welcher Art unsere Ergeb- 
nisse zu modifizieren sind, wenn wir diese Voraussetzungen fallen lassen. 





gle ai 
tert 













12. AUSARTUNGEN 


Wir verzichten nun auf die Annahme (144), halten aber zundchst noch an 
der Voraussetzung fest, dass \1,--- ,An verschieden sind. Wir denken uns 
dann die endlich vielen Lésungssysteme der diophantischen Gleichung 


Pp 
(159) > giki = %& 
=) 





bestimmt, wobei k irgend ein Index der Reihe 1, .-- , p ist und gi, --- 59» 
nicht-negative ganze Zahlen, deren Summe mindestens 2 betragt. In dem An- 
satz (138) waren bisher P; , --- , P, Potenzreihen in ¢,, --- , ¢p ohne konstante 
und lineare Glieder, deren Koeffizienten rekursiv eindeutig durch die Bedingung 
(140) festgelegt wurden. Jetzt schliessen wir aus dem mit unbestimmten 
Koeffizienten gebildeten Potenzreihen P, , --- , P, simtliche Glieder off! --- ¢? 
aus, deren Exponenten einer Gleichung (159) geniigen, und ersetzen (140) durch 
die folgende abgeschwachte Bedingung: Fiir k = 1, --- , p soll eine Identitat 


(160) Qi(ta, --- , Up, 0,--- ,0) = Vir, --- , up) 
gelten, wo V; ein Polynom aus solchen Gliedern cu{' -.- u%? bedeutet, deren 
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Exponenten (159) erfiillen; fir k = p + 1, --- , n soll wie bisher 

(161) Qi(ti, «++, Up, 0,---,0) = 0 

gelten. Es lasst sich auf dem friiheren Wege ohne Miihe zeigen, dass die Potenz- 
reihen P;, --- , Pn Wieder eindeutig bestimmt und konvergent sind, und auch 
die Polynome V;, --- , Vp sind eindeutig fixiert. Da zur Herleitung der Un- 
gleichung (155) die Formeln (140) nur fir k = p + 1,---,n herangezogen 
werden, so bleibt die zu (156) fiihrende Schlussweise bestehen. Bei sémtlichen 
fiir s > © nach dem Nullpunkt wandernden Lésungen von (133) ist also wieder 
Up = 0,---,Un = 0. An die Stelle des Systems (157) tritt aber jetzt nach 
(139) und (160) 


(162) > = rue + Vilwi, ---, Up) (tk = 1, ... , P). 


Nach (134) ist fiir jede Lésung von (159) 
g. = 0, Jrx1 = 0,--- , gp = O, 


folglich ist Vi = Vi(ur,--- , Wea) ein Polynom in uw, --- , uw. allein und 
V, = 0. Die Integration von (162) lasst sich ohne weiteres ausfiihren und 
ergibt 


dys 
Mm =aje , 


(163) uz = age’ + sVo(m) = {a2 + sVo(ar)}e™, 

allgemein durch vollstandige Induktion 

(164) ue = (ax + Wi)e™ (k =1,---,p), 
wo W;, ein eindeutig bestimmtes Polynom in a, --- , a,-; und sist. Da umge- 


kehrt bei beliebiger Wahl der Konstanten a, --- , a» die durch (164) gegebenen 
Funktionen u; fiir s > © gegen 0 streben, so gelten die Ergebnisse des vorange- 
henden Paragraphen auch in diesem Fall, wenn nur u, durch (164) erklart wird. 

Endlich lassen wir auch noch die Voraussetzung fallen, dass die Wurzeln 
Ai, --- ,An alle verschieden sind. Dann ist die Matrix Q = (qz:) in (135) im 
allgemeinen keine reine Diagonalmatrix: Es ist wieder qx: = Ax, Ger = O fiir 
kAlundk #14 1, gy = 0 firk = 1+ 1 und dr; ¥ Aus; fir k = 1 + 1 und 
Ai = Ay ist dagegen q,, entweder gleich 0 oder gleich \,. An die Stelle des 
Systems (137) tritt jetzt allgemeiner 

df; 


> 4 = Afi + x, 


Be = Nala + extn) + xe (k = 2,-+-,m); 
dabei ist e, = 0 fir y. ¥ \e-1, WHhrend im Falle 4. = Ax-1 entweder e, = 0 


oder e, = List. Anstatt (135) erhalt man durch die friiher benutzte Methode 
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die Differentialgleichungen 


(dus _ 
ds 
duy _ 

\ ds 

wo Q,, --- , Q, wieder den Bedingungen (160) und (161) geniigen, mit der dort 


angegebenen Bedeutung der Polynome Vi,---,V,. Anstelle des in (153) 
erklarten Ausdrucks Z betrachten wir allgemeiner 


n 
DD elu’ = 
k=p+1 


mit konstanten Werten hpii,---,/n. Zufolge (165) gilt dann 
dZ 
4 » = hi { (rw + Xe) | we |” ee (Aic eae + Ae Tes Un) } 


ds - gh 


(166) 


= ims + Qilu, «++ , Un), 


(165) 
Ae(ue + CeUn-za) + Qe(ui, +++, Un) (kK = 2,---,n), 


+ > (unQe + te Qi). 
k=p+l1 


Wegen Ap ¥ Apyi ist nun ep4, = O, und der erste Summand auf der rechten 
Seite von (166) lasst sich in der Form 


H = palha| Un|? + > (ost — 6, Pal me i) ual 


k=p+ 
2 


+ > prhy| Un + ox _ * aa 


k=p+1 


schreiben. Wahlt man speziell 


h - :, h = Pk Pk+t = eee 
p+ k+1 2D ke (k p +1, ,” 
so wird offenbar 


n 


(167) H=4 Dd pehe|ux|? = donZo. 


k=pt+1 
Aus (166) und (167) folgt jetzt, wenn ¢ in (152) geniigend klein gew4hlt wird, 
die Ungleichung 


dZo 

— = 49.7 

ds = 3Pn 0 

und daraus wieder das Verschwinden von up41, --- , Un bei simtlichen Lésungen 


von (165), die fiir s > © im Nullpunkt einmiinden. Es bleibt noch das System 


du 
= —_ Ait, 
dux, 


- Nu(Ue + Cetera) + Vil, --+, ua) (hk = 2,---,D) 





ort 
33) 








DER DREIERSTOSS 165 


zu integrieren. Die Lésung hat wieder die Gestalt 
U = (ax + W,)e** (k = 1, --- ,P), 


wo W, ein eindeutig bestimmtes Polynom in s und den Integrationskonstanten 
-- , @-1 bedeutet; insbesondere ist 


WwW, = 0, We = ${ €2dr201 + Ve2(a1)}. 


Damit haben wir die Resultate des vorigen Paragraphen auf den Fall mehr- 
facher Wurzeln A; , --- , An tibertragen, abgesehen von der Untersuchung der 
Realitatsverhiltnisse. Diese Untersuchung liesse sich ohne Schwierigkeit 
durchfiihren; wir gehen aber darauf nicht mehr ein, da bei der Anwendung auf 
die Dreierstosslésungen im Falle einer mehrfachen Wurzel alle Wurzeln reell 
sind und dann tiberhaupt keine imaginiren Gréssen in die Rechnung einge- 
fiihrt zu werden brauchen. 


a, 


13. Dirk DREIERSTOSSBAHNEN 


Wir wenden nunmehr die Ergebnisse der beiden vorangehenden Paragraphen 
auf die Untersuchung des Systemes (118) an. Die charakteristischen Wurzeln 
der Matrix 2 werden im gleichseitigen Faile durch die 6 Zahlen —a@, —a, 
—dz, 43, 44, as in (131) geliefert, im geradlinigen Falle durch die Zahlen — by , 
—b, , be, bs, bs, bs in (1382). Es sind —a), —a,, —ae, —bo , —b; negativ und 
die tibrigen Wurzeln sind positiv oder haben positiven Realteil. Folglich ist 
p = 3 im gleichseitigen Falle und p = 2 im geradlinigen Falle. 

Die Wurzeln —a), —a, , —4d2 sind saimtlich verschieden, ausser wenn m, = 
Mm, = ms ist, und dann ist —a; = —a, = (1 — V13)/6 > —a. Die Wurzeln 
—bo, —b; sind stets voneinander verschieden. 

Ks ist noch festzustellen, wann (159) lésbar ist und welches dann die Lésungen 


sind. Im gleichseitigen Falle ist 
a > a, = a, 2a, > &, a + a2 > &; 
gilt also 
artaytaz=a, «rtytz22 


fiir ein k der Reihe 1, 2, 3 in nicht-negativen ganzen Zahlen z, y, 2, so folgt 
x = 0, y = 0, und es ist entweder 


(168) ee, 209 
mit ganzem g oder 
(169) a; = haz, sak 


mit ganzem h. Die Annahme (168) ergibt nach (131) fiir die in (128) definierte 
Grosse 





_ m2ms + M3m + mm 
(mm + me + ms)? 
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a den Wert 
2 (170) a = v9 “(9 + 2)(8g° — 9 — 2) (9 = 2,3, ...), 
wihrend aus (169) 

(171) 









a = #h(h’ + 1) "(hv + 1)(h — v) @=12...) 





mit 





v = ph? — (A + 1) -1 + FL + 24? + D(A + 12) 


4, folgt. Hieraus ist noch leicht ersichtlich, dass nicht (168) und (169) zugleich 
eintreten kénnen. Beim geradlinigen Falle ist b; > bo ; gilt also 





box + biy = bk, r+y22 


fir ein k der Reihe 1, 2 in nicht-negativen ganzen Zahlen 2, y, so ist 












ri bi = jbo, r=j, y=0 
1 


mit ganzem j, und die in (130) erklarte Grésse 





_ m{l + (1 —o)* + (1 —o)"} +m(l +o! +o”) 
mi + mew? + (1 — w)-?} + ms 


hat nach (132) den Wert 
(172) b=j7 + 3(j — 3). 


Im gleichseitigen Falle bilden die fiir s — « in den Nullpunkt einmiindenden 
Lésungen von (118) eine dreidimensionale analytische Mannigfaltigkeit. Ist 
nicht (170) oder (171) erfiillt, so lassen sich 6,, --- , 5s in Potenzreihen der 
Variabeln 


b 














a,8 


Lord —ages = ate 
(173) Uy = aye a U2 = ae . Us = az = 






entwickeln; dabei hangen die Koeffizienten der Potenzreihen nur von mm, m2, 
m3 ab und a, a2, a3 sind willkiirliche reelle Konstanten. Ist (170) erfiillt, so 
hat man auf Grund von (163) 


(174) UY = (ay aor ciags)e “* 











zu setzen, WO C; eine von Mm, , M2 , m3 abhangige Konstante bedeutet. Im Falle 
(171) ist entsprechend 






(175) us = (ar — couse 7", 





Im geradlinigen Falle bilden die fiir s — » in den Nullpunkt einmiindenden 
Lésungen von (118) eine zweidimensionale analytische Mannigfaltigkeit. Ist 
nicht (172) erfiillt, so lassen sich 6; , --- , 5s in Potenzreihen der Variabeln 






—b — 
1 = Be, ve = Bre? 





entwickeln, wo die Koeffizienten nur von m, m2, m; abhangen und (1, / 



















— 


ih 7m rF7rr» 
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willkiirliche reelle Konstanten bedeuten. Ist (172) erfiillt, so hat man 
U2 = (62 — cafis)e*™ 


zu setzen. 
Tragt man die gefundenen Potenzreihen fiir 6, , - - - , ds in die rechte Seite von 


(119) ein, so erhalt man ds = pa durch eine einfache Quadratur. Da das un- 
bestimmte Integral einer Potenzreihe in den durch (173) oder (174), (175) 
definierten Gréssen u; , U2 , Us Wieder eine solche Potenzreihe ist und das gleiche 
fir die Potenzreihen in v , v2 gilt, so ist also 


a=rvtP, 


wo y eine willkiirliche Konstante und P eine eindeutig festgelegte Potenzreihe 
in UW, Ue, Us Oder in %), ve ohne konstantes Glied ist. Folglich hat p, einen 
Grenzwert fiir s — ©, néimlich die Zahl y. Damit ist also endlich bewiesen, 
dass die 3 Massenpunkte im ruhenden Koordinatensystem in bestimmten Rich- 
tungen zusammenstossen. Durch eine geeignete Drehung des Koordinaten- 
systems kann man erreichen, dass y = 0 ist. 

Tragt man die gefundenen Reihen nach (96), (109), (114) in die Werte der 
Koordinaten 2, y; (k = 1, 2, 3) ein und benutzt (110), so erhalt man im 
gleichseitigen Falle fiir jede Koordinate eine Darstellung 

v= Pw*(ur » Ue, Us), 
wo w*(u; , Ue , Us) eine Reihe nach positiven Potenzen von 


2/3 


uy = at ; Ww = at"*, u = ast”? 


bedeutet, deren Koeffizienten nur von m,, mz, ms abhaingen. Liegt eine der 
Ausartungen (170) oder (171) vor, so ist statt dessen 


us = (a1 + ca% log tt” 
oder 

Ue = (a2 + ca3 log t)t” 
au setzen. Im geradlinigen Falle hat man analog 

w = tw*(v; , v2) 

mit 

1 = pt”, v2 = pel” 
oder, wenn (172) erfiillt ist, mit 

v2 = (82 + cs6i log t)t”. 


Schliesslich kann man noch eine Drehung des Koordinatensystems um kon- 
stante Winkel vornehmen und ihm eine Translation mit konstanter Ge- 
schwindigkeit erteilen. 
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Die Werte a;, a2 und }; sind irrational, wenn m; , m2, ms nicht im Bereich 
der rationalen Zahlen in gewisser Weise algebraisch abhangig voneinander sind. 
Daher ist der Punkt t = 0 im allgemeinen ein logarithmischer Verzweigungs- 
punkt fiir die Koordinaten der Dreierstossbahnen. 

[Zusatz bei der Korrektur:] Inzwischen bin ich aufmerksam geworden auf eine 
Arbeit von G. Sokoloff,’ welche ebenfalls der Untersuchung des Dreierstosses 
gewidmet ist und Reihenentwicklungen fiir die Koordinaten der kollidierenden 
Massenpunkte enthalt. Im Beweis findet sich eine Liicke, die aber durch Be- 
nutzung des Bohlschen Satzes leicht ausgefiillt werden kann. Die vorliegende 
Darstellung diirfte in wesentlichen Punkten den Vorzug grésser Einfachheit 
haben. 


Princeton, N. J. 





6G. Sokoloff, Conditions d’une collision générale des trois corps qui s’attirent mutuelle- 
ment suivant la loi de Newton, Académie des Sciences de l’Ukraine, Mémoires de la classe 
des sciences physiques et mathématiques, Bd. 9 (1928), S. 1-64. 
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INTUITIVE PROBABILITIES AND SEQUENCES’ 
By B. O. Koopman 


(Received March 22, 1940) 


This paper forms a second part of the general study of probability regarded 
as a branch of intuitive logic, the first part having appeared in The Axioms and 
Algebra of Intuitive Probability.’ The point of view, notation, assumptions and 
theorems of the latter are assumed in the present work. The chief object here 
is to set forth the connection between the objective notion of statistical weight or 
frequency in a sequence and the intuitive conceptions of our theory. 

It is necessary first to examine the class of definitions of probability based on 
the notion of frequency in a sequence (collectives) ; we shall reach the conclusion 
that every application of such definitions of probability to experimental science 
implicitly presupposes the a priori intuitive conception of probability (§1). Further 
progress in the exact formulation of the ideas requires the scrutiny of certain 
logical discriminations involved in the relation of asserted and contemplated 
propositions, a discrimination the ignoring of which leads to fallacies (§2). The 
notion of symmetry and the related one of independence in a set of trials is then 
studied (§3); the former replaces in the present theory of sequences the notion of 
random in the usual theory of collectives. Finally the theorems relating statisti- 
cal weight with intuitive probability (numerical and otherwise) are established. 

There is thus achieved a complete synthesis of the intuitive concept of proba- 
bility with the objective notion of statistical weight which forms the basis of 
statistics and quantum mechanics,—and this without the assumption of a single 
general principle beyond those which have already been posited in our earlier 
work. 


1. Probability as a Frequency 


An event E (such as the tossing of a coin, the search for a photon with a 
sensitive plate, the observation of fatality following an inoculation, etc.) is 
regarded as having two possible outcomes, “success” (labeled 1) and “failure” 
(labeled 0). An infinite sequence of trials of E are made (conceptually) under 
essentially the same conditions, the results (1 or 0) forming the sequence (a): 
(@,a2,---) (a; =Oorl1). The theory of probability in collectives starts with 
the following assumptions.* 





‘ Presented to the American Mathematical Society April 26, 1940. 

* Annals of Math., Vol. 41, No. 2 (1940). Presented to the American Mathematical 
Society February 25, 1939. This paper will be referred to herein as AAP. 

*Cf., e.g., R. von Mises, Probability, Statistics and Truth (London 1939). A. H. Cope- 
land, Consistency of the Conditions Determining Kollektivs, Trans. Amer. Math. Soc., Vol. 42 
(1937) p. 333, as well as the references contained therein. 
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First assumption. limy+o(ai + +++ + an)/n = w. 

Second assumption. If (a’):(a; , az, ---) ts a sub-sequence of (a) defined by 
any place-selection s of the class S then limn+o(ai + --- +a,)/n = w. 

Here a place-selection is regarded as defined by a precise mathematical rule, 
and the class S as having been exactly delimited: we shall raise no question on 
this point.’ Any sequence (a) for which these assumptions are made is called a 
collective (properly: with respect to S), and according to what shall be termed 
herein the collectivist definition of probability, the number w (naturally 0 S w 
< 1) is defined as the probability of success of E (in the collective (a)). 

Our object is not with the mathematical theory of collectives. We wish 
merely to ascertain whether the collectivist definition of probability can be 
applied to an experimental phenomenon without an auxiliary notion of an a priori 
subjective order reducing in essence to that of intuitive probability. And since 
we are dealing only with Assumption 1, our critique will apply to the whole class of 
theories which regard probability as merely a frequency in a sequence. 

Let us suppose that I am given the information that (a) is a collective, that I 
know the definition of S, and also the value of w (e.g., let w = 0.1). Further- 
more, let me excise from my brain all a priori intuitive notions of “likelihood”, 
“degree of conviction’’, ‘‘reasonableness’’, etc., so that only my strict logical and 
observational powers remain. Just what conclusion can I derive from this 
knowledge concerning (a) and applying to the results of the totality of trials 
of E which I can make in my lifetime? Since I can only make a number N of 
trials obviously <10", and inasmuch as any set of values (a, --- , aw) (@n = 0 
or 1) will be mathematically consistent with the assumptions regarding (a), it 
follows that I can draw absolutely no conclusion at all (in the only terms which I 
can understand, i.e., strict logic), so my information regarding (a) is totally 
irrelevant to my experience with E. 

The situation is exactly the same for any individual whose name we can give. 
Even if it could be proved that some individual exists for whose set of trials of E 
surely (a; + --- + ay)/N = 0.1, we could never inform him that he was this 
favored one, nor could he ever know it himself—until, indeed, his trials of E 
are all completed, in which case the information concerning (a) would become 
unnecessary. And an individual can be replaced by all society during a given 
historical era: The number WN of total possible trials which it can make is much 
larger, but once the era has been named an upper limit of N is set and any values 
of (a1, --- , ay) are consistent with the assumptions concerning (a). It may 
be said that a time will come when all observations made thereunto will give for 
(a; + --- + ay)/N a value close tow. But this does not remove the difficulty: 
The time will never come when it can be proved that the proper value of N has 
been reached. 

A finite theory of collectives has been proposed in which the finite set 





* For a discussion from the point of view of modern logic ef. A. Church, On the Concept 
of a Random Sequence, Bull. Amer. Math. Soc. Vol. 46 (1940) p. 130. 
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(a, --- ,@m) replaces the infinite sequence (a;, a2, ---) and such ratios as 
(a, + --- + awn)/N replace the corresponding limits.” Here M is regarded as 
enormous—“practically infinite”—so that the previous N is minute in compari- 
son, and any values of (a, , --- , aw) are, here also, consistent with any values of 
the above ratio within limits of experimental error: The earlier difficulty is 
encountered here as before. This will naturally be quite otherwise if N is of the 
same order of magnitude as M; but in this case (even assuming that the experi- 
menter could ever know it) the theory will be superfluous. The matter can be 
put thus: The problem of sampling is insoluble—nay, unstable—in terms of 
strict logic divorced from all intuitive notions of probability. 

Certain philosophers regard the theory of collectives as affording not a theory 
of knowledge but a pattern of behavior.” But it is always a question not of 
any behavior but of a most favorable behavior in a certain highly precise sense. 
And the proposition that the particular pattern of behavior derived from the 
theory of collectives is actually this most favorable one becomes subject to all 
the aforementioned difficulties. 

Leaving collectives for the moment, we consider the great class of physical 
quantities which are defined as limits: It soon appears that here also the result 
of no measurement of the sort which can be carried out in the laboratory either 
determines or is determined by what is deemed the true value of such a quantity 
—by strict mathematical logic alone.’ For example the density p of a continuous 
medium at the point P is defined as p = lim M/V (as V — P): The finite set of 
ratios M/V which I can ever measure have values totally devoid of strict logical 
relationship with the value of p. Nor is the difficulty due to the idealization 
involved in the definition of p: If p is defined simply as the value of M/V 
measured by a specified person who has instruments a hundred times more 
effective than any which I possess, the difficulty remains the same (cf. the case of 
finite collectives). 

Of course, when I find that my values of M/V for rapidly decreasing V vary 
by amounts growing negligible, the common sense of the experimenter leads me 
to the conclusion that the final will not differ appreciably from the actual value 
of p. But precisely what does it mean to conclude something by common sense 
which is not derivable by strict logic? If we have made the meaning of intuitive 
probability in the least degree clear, it must be apparent that such a conclusion 
of common sense is precisely an instance of a conceptualism of intuitive proba- 
bility. Thus we are not entitled to it under the rigorous exclusion of all notions 
but those of strict logic: Our values of M/V and the value of p must remain 
without link or bind. 

Returning to collectives, let us consider how those who regard probability 





*R. von Mises, l.c., p. 121. 

*H. Reichenbach, Wahrscheinlichkeitslehre, Leiden 1935. 

“It is understood, of course, that certain quantities of the sort here considered may be 
derived from others by the laws of the physical theory. We merely claim that there are 
certain initial points at which the present difficulty arises. 
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as a frequency have sought to deal with the present question.” The limit w is 
regarded as of the category of (idealized) physical quantities defined by limits 
such as p; the observational material (a, + --- + aw)/N is the analogue of 
M/V. And it is regarded that conclusions applying to (a1 + --- + ay)/N 
can be drawn from w and vice versa in exactly the same sense and with the same 
validity as in the case of M/V and p: Why should a practical experimenter 
hesitate to do this more in the former case than in the latter? 

This reasoning of course is impeccable. But it must be clear by now that it is 
two-edged: either it compels us to concede that some rudimentary notion of 
intuitive probability is presupposed in the applications of the collectivist theory, 
or else it merely assimilates one insurmountable difficulty to another 

Having reached the point where it is compulsory to regard the applied notion 
of frequency as presupposing the intuitive concept of probability, the question 
is raised as to precisely how the relation between the two is to be formulated. 
Perhaps the most obvious way of doing this and at the same time making the 
fewest demands on the intuition would be to enunciate the following principle: 

The intuitive probability of success of E (in (a)) increases with w. 

Put in explicitly logical form and with the intuitive symbol (<) this becomes 

Hypotuesis. (i) The sequence (a) of trials of E is a collective of frequency w. 

(ii) The sequence (a*) of trials of a second event E* is a collective of frequency u*. 

(iii) w < w*. 

Conciusion. (E succeeds on n trial) < (E* succeeds on n* trial). 

This principle would enable us to derive non-trivial instances of (<) from the 
hypothesis (i) (ii) (iii) which do not contain any non-trivial statements involving 
(<) and thus place us in the presence of a situation utterly new in our theory, 
and quite contrary to our view expressed at the end of §1 of AAP to the effect 
that no such derivation is possible in the nature of things. 

It is easy to show, however, that the suggested principle must be modified. 

Case 1: suppose that in a particular instance of its application we are reliably 
informed of the outcome of some or all of the trials of Z and E%*, i.e., that we know 
the values of certain a, , a, . Then evidently the conclusion would be absurd 
if we knew e.g. that a, > a,—whatever the values of w and w*. 

Case 2: suppose that we are informed of the results of certain of the outcomes 
by a person in whose reliability we place considerable but not absolute trust. 
If he has declared that a, > a, this might lead us to deny the conclusion of the 
principle for the n’** trial—and on perfectly reasonable grounds. 

Case 3: suppose that we believe that certain more or less external events 
(weather, sun-spots, phases of the moon, incantations) are capable of influencing 
the outcome of HZ and E*, being favorable let us say to E, unfavorable to E*. 
Then we might well refrain from admitting the conclusion, particularly if the 
values of w and w* are exceedingly close. A second person may regard our idea 
of an influence on F and E* as reasonable, far-fetched, or absurd, depending on 





® R. von Mises, I.c., p. 124. 
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the case in point, but he will never be able to prove that we are wrong by any 
reasoning or objective experiment which does not itself start by making a priori 
assumptions involving (<)—certainly not by the use of reasoning based on the 
suggested principle concerning sequences if a glaring petitio is to be avoided. 

Evidently hypotheses (i) (ii) (iii) are insufficient to insure the conclusion: 
Something further must be added which is in the nature of an exclusion of the 
subsidiary knowledge (Case 1), reasonable certainty (Case 2), or disposition of 
belief (Case 3). Again, if we have made the conception of intuitive probability 
in the leastwise clear, the form which this additional hypothesis must take is 
evident: It must make assertions in terms of (<).. The precise phrasing of the 
condition will incorporate the idea that the various trials of E are made under 
essentially the same circumstances, as well as the notion that the trials are either 
independent events or have a constant influence upon one another. This 
whole question is studied in §3, the final condition being what is there termed the 
symmetry of the sequence of trials. The final addition to the hypothesis thus 
becomes: 

Hyporuesis. (iv) If (t:,---,%) and (ji, ---,js) are any two sets each 
containing t distinct positive integers, then the intuitive probability that the trials of E 
of orders 1, --- , t, should all lead to success is equal (=~) to that for trials j,, --- , 
jt. This is assumed for t = 1, 2,---. 

(v) Similarly for E*. 

The question now arises whether under all these hypotheses (i) (ii) (iii) (iv) (v) 
the conclusion may reasonably be maintained, and if so, whether this fact 
constitutes a new principle of probability which requires to be posited. The 
answer is trenchant: Absolutely no new principle need be assumed, and the 
conclusion in question is the conclusion of a theorem which has as its hypothesis 
the assumptions (iii) (iv) (v) and the first half only of assumptions (i) and (ii). 
And that part of (i) and (ii) which is discarded is precisely Assumption 2 in the 
definition of collectives,—that very condition, namely, which has been found 
most unwelcome inasmuch as it involves the somewhat arbitrary choice of the 
class S of place-selection rules. 

Before proceeding it is necessary to settle one question. Granting that 
lim (a + --- + a@,)/n = w, we may clearly rearrange the sequence (a) (at least 
when 0 < w < 1) so that any other preassigned limit is obtained; does this not 
lead to inconsistent results? This question is answered in the negative by 
scrutinizing the hypothesis concerning the existence of the limit w in the light of 
deeper logical considerations, whereupon the hypothesis in question appears 
capable of two utterly distinct interpretations; the one presupposed in our 
theorem is not the one which permits this rearrangement. These logical pre- 
liminaries occupy us in §2. 

One final remark regarding the precise formulation of this theorem in terms of 
the common presumption h: What right have we to assume in a given application 
that the concrete circumstances in the n’** trial are the same as in the m’*? 
It would clearly be more fitting to let the presumption in the first case be hn , 
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that in the second be h»,. The answer to this question is that in the case in 
point we assume that a certain h can be formulated for which a,/h, ~ a,/h 
(n = 1,2, ---): this is part of the assumption that the trials are all made “under 
essentially similar conditions.” While it is an assumption external to the theory 
of probability, being a statement of the observer’s attitude towards a physical 
situation, its precise formulation is made possible in the language of the theory, 
which thus fulfills exactly its appropriate function and nothing more. 

These remarks have bearing on Theorems 11 and 12 below. 










2. A Logical Distinction 


It has been explained (AAP §2) that the present theory makes use of two 
distinct classes of propositions. There are firstly the contemplated propositions, 
these being the concrete statements of the outcome of particular physical or 
biological events verifiable in principle by the performance of appropriate crucial 
experiments: They are denoted by lower case Latin letters and their finite 
(~ - V) combinations. There are secondly the asserted propositions, which can 
always be denoted by connecting a pair of symbols for contemplated propositions 
by means of a single one of the assertive symbols (C, <) or their derivates 
i (=, , <, ||, ete.). All the axioms and theorems of the present theory have a 
standard form: both hypothesis and conclusion are sets (finite or infinite) of 
asserted propositions; for in last analysis they are but statements of the laws of 
consistency governing any aggregate of logico-probability assertions which a 
given individual at a given moment can make.” 

It will be convenient henceforth to depart from one convention of notation 
made hitherto: We will apply the logical symbols (~ - V), as well as [] and 
a >., to asserted propositions, thereby obtaining new asserted propositions of the 
a same logical type. Thus for example 
“te 


(i) ~ (a/h < b/k); (ii) (@=1) V (b=1); 


















(ii) TI@=1), ie, (=m =1)--; 





are regarded as mere abbreviations for assertions which may be couched re- 
spectively in the following intuitive-logical forms: 
(i) The individual at the moment considered refuses to regard a/h <_ b/k. 
(ii) Either it is asserted that a is true, or else it is asserted that b is true, or 
else that both a and b are true. 
(iii) For each n = 1, 2, --- it is asserted that a, is true. 















* Two comments may be made in passing: Firstly, in view of Axiom I, (C) may be re- 
placed throughout by (<); for a C b which coincides with a ~ b C0 is coextensive with 
a~b/1 < 0/1. Secondly, in the assertion a/h < b/k, on the left h is in a sense asserted, 
on the right on the other hand, k is regarded as asserted. Such temporary or localized 
assertions are not of the kind which form hypotheses or conclusions in our axioms or theo- 
rems; that is why we have distinguished them by the appellation of presumptions. 
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It is to be emphasized that this notation is introduced purely for convenience 
as an abbreviation: without it all the forthcoming results would subsist. We 
shall assume the intuitive-logical rules for dealing with these symbols, and use 
parenthesis in a self-explanatory manner.’ 

We now come to the point where a certain logical distinction must be explicitly 
noted: A disjunction of asserted propositions is fundamentally different from the 
assertion of the corresponding disjunction of the contemplated propositions. Thus 
if a and b are contemplated propositions, the asserted proposition (a = 1) V 
(b = 1) is fundamentally different from the asserted proposition (a V 6) = 1. 
Similarly, the two following assertions are absolutely different. 


x (a, = 1); (o a,) = 1. 


Thus when (a = 1) V (6 = 1) occurs in the hypothesis of a theorem, we can 
always split the theorem up into two theorems, the first with a = 1 in the 
hypothesis, the second with b = 1: What is common to the conclusion of both 
theorems is then taken as the conclusion of the original theorem with the hypoth- 
esis (a = 1) V (b = 1). Similarly, if >>.(an = 1) is given by hypothesis, to 
prove the conclusion amounts to proving that it follows from each hypothesis 
a,=1(n=1,2,---). This is essentially different if the hypothesis is (a V 6) 
= 1 in the first case, (>).@,) = 1in the second. This point is so fundamental 
in the foundations of probability and has been so generally ignored that we shall 
consider in detail some of the paradoxes to which its disregurd may lead. 

Firstly there is the frequently heard objection to the possibility of any theory 
of probability, typified by the remark that proposition a is either true or false; 
in either case such an assertion as a/1 = ~a/1 is untrue; hence it is an impossible 
assertion. To be sure, we always assert that a is either true or false in the sense 
that we always assert (a V ~a) = 1; but in the above reasoning this is falla- 
ciously confused with (a = 1) V (~a = 1) and since the conclusion ~(a/1 ~ 
~a/1) is a consequence of a = 1 as well as of ~a = 1, its general validity is 
regarded as established. Instances of like kind lead us to the view that the 
distinction upon which we are dwelling is indeed a sine qua non in the theory of 
probability. 

Secondly, one might attempt the following proof of the Axiom of Alternative 
presumption (AAP §3), or better, of the following generalization: 

Hypotuesis. (i) a/hb, < r/s(n = 1, 2, ---). 


(ii) hC Do bn, ie, (h DS bn) = he : 
n=1 n=1 





‘© Clearly we could carry the transcription into logical notation one step furthet anil 
express all our axioms and theorems as implication statements between the (C, <)-asser- 
tions. Thus we would be in the presence of three distinct logical types of propositions: 
the contemplated propositions, the (C, <)-assertions, and the axioms and theorems of 
probability. But it is unnecessary and may be confusing to carry the symbolization so 
far in a work of this nature. 
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(iii) bsb; = O(7 ¥ J). 
Conciusion. a/h < r/s. 
- For (ii) can evidently be written 


(D> bbs) = h, 


n=1 


which, if the fundamental distinction were not observed would be confused with 


aay 35 ttm, 0: Ries > (h C br). 


This latter can, as we have already explained, be split into the sequence of 
hypotheses h C 6, (n = 1, 2,---). But in each case hb, = h and we have 
a/h = a/hb, < r/s, and thus the conclusion, being true in each case, follows in 
general. The invalidity of this proof due to the confusion of (iii) with (ii) is 
what has forced us to list Axiom P as an unproved assumption. As a matter of 
fact the above infinite extension of Axiom P appears to be false, as the following 
example would show. 

Consider two sequences of propositions (a; , a2, ---) and (a; , ag, ---) satis- 
fying the conditions 


(1) a= Das; € (a Va’) =h#¥0; 
aya; = aja; = 0 (t ¥ 9); 
a,a; = 0 (i,j = 1, 2, ---); 
(3) a;/h & a;/h & a;/h & a;/h. 
No a; or a; = 0 as this would lead by (3) and AAP Theorem 1 to this relation 
for all a;, a; , hence to h = 0 contrary to (1). Further, the evident relation 


a; V a; V a,/h a; V a; V a/h in combination with (3) gives, by AAP 
Theorem 2 (i.e., Axiom D sharpened) 


(2) 


, , , / , , , , 
a;/a; V a; V a, © a;/a; V a; V a, Y a,/a; V a; V a 


from which it appears (AAP Theorem 20) that a;/a; V a; V a; is appraisable and 
of numerical probability 1/3. Hence, if r/s is appraisable and of numerical 
probability 5/12, we have a;/a; V a; Vax<r/s. Onsetting b, = dn V dzn—1 V Aly, 
(n = 1, 2, ---) and noting the relations a/hb, = abn/bn = Gn/Qn@gn—102n We find 
that Hypothesis (i) of the theorem we are testing is verified; and (ii) is a con- 
gn of (1), (iii) of (2). Hence the conclusion a/h < r/s, hence p*(a/h) S$ 
5/12. 

On replacing a etc. by a’ etc. in the above reasoning, we obtain p*(a’/h) S 
5/12. Hence by AAP Theorem 18 we have p*(h/h) = p*(a V a’/h) S p*(a/h) 
+ p*(a’/h) = 5/6. And since p*(h/h) = 1, we have a contradiction. 

Returning to the sequence of trials of E, let a, denote the contemplated experi- 
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mental proposition “the n’* trial leads to success.”” We have now to cast into 
unambiguous logical form the definition of the statistical weight or frequency of 
success w. According to the First Assumption of Collectives: If r(n) is the 
number of the true propositions in the set (a,, --- , Qn), then limy+o 7(n)/n = w. 

First of all, how is this definition of r(m) to be understood? At the two ex- 
tremes there are the following interpretations, denoted for brevity by U(n, r) 


and V(n, r): 


U(n, r): >> (as, coe Me, ’ Gaag 2? “Gy, ™ 1), 


(s) 


V(n, r): (2 Get +++ Os, ~ Os,4, +++ ~Gs,) = 1. 
8) 


Here (s) = (81, --- , 8.) denotes any permutation of (1, --- , m) and the Dw 
calls for the formation of all the terms for all possible (s) and their combination 
by disjunction (V): that only n!/r!(n — r)! of the terms are distinct is im- 
material. But between these two extremes are a host of intermediate possi- 
bilities; thus in the case n = 3, r = 2 there are formulations such as 


[(ayaz ~~ ag V 103 ~ de) = 1] V [aeas ~ a = 1). 


It will be realized at once that V(n, r) is in a certain sense the weakest of all 
these expressions; it turns out to be the natural one for the theory of probability. 
But there is still a further modification in the direction of weakening which can 
be made, as we shall now see. 

The explicit wording of the definition of w would now appear to be: “For 
every positive integer u there exists a positive integer m such that assertion 
V(n, r) holds for all n = m, m + 1, m + 2, ---, and all r between nw — n/u 
and nw + n/p.’ This suggests the precise logical rendering: 


nutn/p 


I > dO) ven, r). 


p=1 m=1 n=m nw—n/p 


But this is capable of further weakening; for 


nwotn/p 


Ee: (r) (2 Gs, >* + As, ™ Os,44 °°° ~ As,) sa 1} 

nw—nip 
is only one possible interpretation of the statement: “the number of true proposi- 
tions in the set (a, --- , @,) is between nw — n/p and nw + n/p.”’ Another 


rendering is the assertion which we will call W(w, », n) suggested by modifying 
the preceding formula into 


nwo+n/p 
{>) (r) > as, eee Os, ™ Os,41 °°* ~ ds,} =], 


nu—n/p (8) 
Some evident algebraic manipulations allow it to be put into a simpler form. 
Let = t(w, u, n) denote the least integer > nw — n/u, and f = f(w, u, n) the 
least integer > n(1 — w) — n/p. Let 
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(p, q) = (pn, oes oe M,-** 54) 


denote any set of ¢ + f distinct integers chosen from (1, ---,m). Finally, 
employ >_(p, ) to denote the disjunction (V ) of all the terms formed by taking 
all possible choices of (p, g) (w, u, n being fixed) in the summand. Then the 
definitive formulation of the weakest interpretation of the assertion “the number 
of true propositions in (a, --- , dn) is between nw — n/p and nw + n/y” is as 
follows 


W(w, u, 2): (2 Ap, *** Ap, ™ Aq, *** ~ Aq) = 1. 
P.@d 


(Note that only n!/t!f!(n — t — f)! terms in the disjunction are distinct). In 
terms of this we have the assertion characterizing w: 


W(w): IT > II Ww, B, n). 
p=1 m=1 n=m 

We are now in a position to consider the following paradox: When 0 < w < 1 
there are infinitely many true propositions in (a; , a2, ---) and infinitely many 
false; let the former be denoted in their order of occurrence by (a,, , @;,, ---) 
and the latter by (a,, , @,,, ---). Then any number 6 (0 < @ S 1) being given, 
an explicit process is easily formulated which combines these two sequences into a 
single sequence—which is but a re-ordering of (a, a2, ---)—for which the 
frequency of the a,; is 6. How then can a frequency be taken as the numerical 
probability of success of E? 

The whole question hinges on the meaning of a,,. If the statement “a,, is 
the first true proposition in (a:, a2, ---)’’ be taken to mean “true in Nature” 
i.e., as experimentally verifiable but of unknown verity, then the frequency 
of true propositions has nothing to do with the definition W(w) of w. If it 
means “asserted as true,” i.e., = 1, then W(w) does not allow a,, to be defined. 
For W(w) allows us only to conclude that for some w, u, n for which t(w, u, n) > 
0, we have the assertion W(w, u, m); in one sense this tells us that at least one 
a; in the set (a; , --- , dn) is true, but in precisely the same sense as the assertion 
(a V ~a) = 1 tells us that at least one of (a, ~ a) is true: it leaves the verbiage 
“the first proposition in (a, ~ a) which is asserted as true’ without meaning. 
In last analysis it is but the first paradox which we have considered in more 
complicated form. 

If W(w) were replaced by 


Ul >I me (ap, se* Up, ~™ Aq +> ~ ay = 1) 


the definition of (a,, , a;,, ---) and (ag, , dy, , ---) would be possible. Thus if 
w= 1/2, = 4, and n = 4v = m, we have 


Dd (ap, eee Ap, ™ dq, eee ™ Ag; = 1). 
(p,@) 
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Now { = least integer = nw — n/p = 2v — vy = vso that ¢ = »; similarly, f = ». 
Thus we have 


(a, «++ Ay ~ G41 +++ ~ Ay = 1) V 7-2 V (dg, +--+ Gay ~ Ay +++ ~ Ay = 1). 


The disjoined terms are thought of as arranged lexicographically according to 
subscripts in the set of first factors such as a; --- a,, which are themselves 
supposed to be written in increasing order of subscript. Thus we may speak of 
the first term asserted as true, i.e., such that ap, --- Gp, ~ Gp,4, °°: ~ Ap, = 1, 
and then set 7; = 71 : a, is defined. Similarly for later terms. 

Thus this paradox like the rest is resolved by maintaining the logical distinc- 
tion which is the subject of this section. 

Before closing we shall consider an extension. Let h be the common pre- 
sumption made in envisaging the outcomes of E: We are considering the se- 
quence of eventualities a,/h. We wish to formulate an assertion W,(w) which 
is the extension of W(w) and characterizes w regarded as the frequency of 
successes of E on the presumption that his true. Such an assertion is evidently 
furnished by replacing (---) = 1 in the above by the assertion h C (---) (or 
equivalently h-(---) = hk). Everything else being as above, we set 


W,(w, u, n): hod Ap, +++ Ap, ~ Ag, +++ ~ Ay 
P,@) 


and correspondingly, 


Ww): TE 5 TT Walw, w, 7). 


p=1 m=1 n=m 


Finally, if Dco<w<1 is used for the disjunction of all the terms formed by 
letting w range from 0 to 1, the assertion: “there is a frequency of successes on 
the presumption h’’ shall be formulated as 

W;: > W,(w). 


0swsl 


3. Symmetry and Independence 


DEFINITION 1. Any finite or countable set of propositions (a; , a2, ---) shall 
be said to be a symmetric set with respect to the proposition h(+0) when for every 
possible positive integral t 


(1) ai, -++ Gi,/h & aj, -++ ;,/h 
holds for every choice of the t distinct integers (1; , --- , i) and the t distinct integers 
(ji, 7 » Jt). 

Here h ¥ 0 denotes the assertion ~ (h = 0). 

Let o(1,--- ,2%m) denote a finite combination of the symbols 2, --- , 2m 


with the aid of the logical constants (~ - V), the convention being that the 
formal laws of Boolean algebra hold for these symbols; such a (21, --+ , Lm) 
may be called a Boolean polynomial in these m (propositional) variables. If by 
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application of the formal Boolean identities alone, i.e., without assuming any 
individual properties of z:, --- , Zm, it may be reduced to 0, 1, or some other 
given proposition not involving the variables, we will say that it is identically 
equal (=) to this proposition. 

TuroreM 1. If (a, a2, ---) is a symmetric set with respect to h, then for any 
g(t1,--+ , 2m) (m not exceeding the number of propositions in the set) the relation 


(2) ea, ar ai,,)/h ~ g(a;, eT ae An) /h 


holds for every possible pair (i: , --- , tm) and (ji, --- ,Jjm) of m distinct positive 
integers each. 

Elementary algebraic manipulations show that any ¢(2, --- ,2m) can be 
expressed as the disjunction of disjoint monomials 2;, - - - Te, ~ kya. ~ Thea, 
thus when (2) has been established for monomial ¢(2,, --- ,Z%m) its general 
validity becomes an immediate consequence of the theorem of total probability 
(AAP Theorem 6, sharpened). The relation 


oh? ay oo Gag 8 ;,4,/h © aj, 109 Oey Geis SR Qi,4,/h 


is proved by induction for all s. Assume it true for all values of the first sub- 
script ¢ and all values of the second between 0 and s inclusive and apply AAP 
Theorem 7 (sharpened) to 


CG + Se Ra, °° Di, 4 Bins 4,/h PS Aj, +++ Aj, MAjyy ee ~ igs igs s/h, 
(i, ++ Gi, ~ ings ss ~ Gig, Gi, +++ Dig ~ Digg y 20s ™ Digg Digeetis h 


respectively replacing a; , b; , hi , etc.). The required relation is an immediate 
consequence. 

Corotuary. If (a; , a2, ---) is symmetric with respect to h, then any subset of 
(a; , a2, ---) and any subset of (~a, , ~a2, -- -) is likewise. 

DerFIniTion 2. Any finite or countable set of propositions (a,, a, -:-) 
shall be said to be an independent set with respect to the proposition h( +0) if firstly 
ha; ¥ 0 or h, and secondly 


(3) a;/ha;, --- ai, ~ai/h 


for every possible t and every choice of t + 1 distinct integers (i, 1, --+ , %1). 

We must prove that (3) has meaning in every case, i.e., that ha;, --- ai, ¥ 9. 
This is assumed in the hypothesis when ¢ = 1. Proceeding inductively, 
let it be assumed for the value t; we then will show that ha;, --- di,,, # 0. 
If ha;, --- a;,,, = 0 the following relations would occur: 


Qi,4,/h & aj,,,/hai, + A, = ha, Pio) ai,,,/hai, +++ Qi, = 0/ha:;, +++ Gi, 


and hence ha;,,, = 0 (AAP Theorem 1) contrary to hypothesis. 
A precisely similar process shows that ha;, --- a;, # h, and furthermore that 
if (t1, --- , eye) are any ¢ + s distinct positive integers 


hai, » alts Qi, — Qis+) io Se. ree Diss A 0 or h. 
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From this relation Theorem 2 follows immediately: 
TuroreM 2. If (a, a2, ---) 78 an independent set with respect to h and if 
- , tm) is any Boolean polynomial not identically 0 or h, then for every m 


g(r mee . 
distinct (i1,--+ , tm), he(ai,, +++ ,@i,) # O or h. 

TueorEM 3. If (a;, G2, ---) ts an independent set with respect to h and if 
y(ti, --- ym) (#0, h) ts as above, then for any m + 1 distinct (i, i,, --- , im) 
(4) a;/he(ai, oT HS ai,,) ~ a;/h. 

As in the proof of Theorem 1, g(x, --- , 2m) is expressed as the disjunction 
of disjoint monomials z,, --- Xr, ™ Tho, +++ ~ Tez4,- Once the relation 
(5) a/ha;, 209 Ag, ™ Viggo 0 ™ Migs, ~wa;/h 


has been established the truth of (4) will follow by an immediate inductive 
application of Axiom P.” 

We assume the truth of (5) for all values of the first index ¢ and all values of 
the second between 0 and s inclusive, and seek to prove it when the second index 
is replaced by s + 1. On account of the assumption we have 


Q;/haj, +++ Dig ~ Diggs 6 ~ Ding Bingen, © Ai/h 
~ a;/ha;, cos Bi ~ Gig, + ~ Oi, 
The desired relation follows on applying AAP Theorem 10 (sharpened) with a; , 
Dineen,» AQi, +> Gi, ~ Ging, +++ Gi,4, Teplacing the a, k, h, respectively, of that 
theorem. 
Corotuary. If (a, , a2, ---) 18 an independent set with respect to h, then any 
subset of (a1 , d2, ---) and any subset of (~a, , ~a2, ---) is likewise. 


The notion of symmetry corresponds with that of events all equally probable 

and such that any combination of results of one set is as probable of occurrence 
as the same combination of another set. The notion of independence on the 
other hand corresponds with the idea that knowledge of the occurrence of some 
of the events does not change the probability ascribed to that of the others. 
A set can be symmetric without being independent, for symmetry does not 
exclude hg(a;, --- , @m) = 0, h: sets in which a, = a2 = --- orin whicha V - 
V Gm = 0 may be symmetric but can not be independent. On the other hand 
sets may be independent without being symmetric; for example a, a2, etc. 
may denote success of respective trials of unconnected events but events of 
different kind and thus in general, different probability. Hence the significance 
of the following 





"The theorem employed here is the following: 

If hee; = 0 for all i ¥ j and if a/he; < r/s (i = 1, ---, n), then a/he < r/s where c = 
C1 Vi«es Ven. 

This is proved inductively from the case n = 2, in which it is a self-evident consequence 
of Axiom P(e; and h(c; V cz) replacing b andh). The sharpening and strengthenings are 
mediate. This theorem belongs properly in AAP §4, just before Theorem 13. 
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TuroreM 4. If (a,, a2, ---) is an independent set with respect to h and if 
a;/h =~ a;/h for all i and j, then (a, , a2, ---) 18 @ symmetric set with respect to h, 
For we have by hypothesis 


a;,/hai,  ai,/h & ai,/h  a;,/h & a;,/h & a;,/haj, . 


On applying Axiom C, (sharpened by AAP Theorem 2) with h, ai, , ai, , h, aj,, 
a;, in lieu of hy, a1, bi, he, a2, be, the result a;,a;,/h © a;,a;,/h is obtained; the 
proof of the general relation (1) proceeds by induction. 

In the classical theory the only relations of probability are those concerned 
with numerical probability. The notions of symmetry and independence are 
less restictive insofar as equiprobability (a/h ~ b/k) implies equal numerical 
probability (p(a/h) = p(b/k)) and not conversely; but they are more restrictive 
in that the appraisability of every eventuality is assumed. We give the follow- 
ing definitions and theorems (in essence familiar) mainly for completeness. The 
proofs are supplied immediately by means of the laws of total and compound 
probability (cf. AAP, §5, Theorems 23 to 28). 

Let (a, a2, ---) be any finite or countable set of propositions, let h ¥ 0 
be a further proposition, and assume that all the eventualities with respect toh 
(i.e., remainder classes with respect to the principle ideal (~h) in the Boolean 
ring determined by (1 , a1 , d2 , -- -)) are appraisable. 

DerFINITION 3. The set (a,, d2,---) shall be called numerically symmetric 
with respect to h when for all indices as in Definition 1, 


(5) pai, --- ai,/h) = p(a;, --- a;,/h). 


THreoreM 5. If (a1, a2, ---) is a numerically symmetric set with respect to h, 
then for any Boolean polynomial g(x; , --- , Lm) 


P(P(Gi, , +++ , Gin)/h) = p(o(a;, , --+ , Fim) /h). 


CoroLuary. The numerical symmetry with respect to h of (ai , a2, ---) implies 
that of any sub-set of (a; , a2, ---) and any sub-set of (~a, , ~a2, ---). 

DeFIniTION 4. The set (ai, a2, ---) shall be said to be numerically inde- 
pendent with respect to h if firstly0 < p(ai/h) < 1 for alli, and secondly (the indices 
being as in Definition 2), 


(6) p(ai/ha;, --- a:,) = p(ai/h). 
THEOREM 6. [If (a;, a2, ---) is numerically independent with respect to h and 
g(%1,---,2%m) any Boolean polynomial (# 0, h) then for every m distinct (ti, 
ae Um) 
0< P(y(ai, , Pree » Vin) /h) <1. 


THEOREM 7. If (a,, a2, ---) and o(%,---,2%m) are as in Theorem 6, then 
for any m + 1 distinct (1, i, --+ , 4m) 


p(ai/he(ai, aleil a,,)) = p(a;/h). 
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CoroLiaRY 1. Under these conditions p(a;, --- ai,,/h) = p(ai,/h)p(a;,/h) --- 
p(ai,,/h). 
CoroLLARY 2. The numerical independence with respect to h of (a, , a2, ---) 


implies that of any sub-set of (a, , 2, ---) and of any sub-set of (~a, , ~a2, -- -). 
CorotuaryY 3. If (a: , a2, ---) 78 a numerically independent set with respect to 


hand if for alli and j, p(ai/h) = p(a;/h), then tt is numerically symmetric. 


4. Theorems on Sequences 


TuroreM 8. If (a,,--- ,@n) 18 symmetric with respect to h and if precisely 
r(0 < r S$ n) of its elements are implied by h in the sense that (§2) 
Vin r): ho } e Gs, +++ Os, ™ As,4, °° ~ As; 
(8) 


then a;/h is appraisable and p(a;/h) = r/n (¢ = 1, --- , n). 
There is no loss of generality in assuming that a; C h, i.e., ha; = a; (¢ = 1, 
. , n); for this property can, always be secured by replacing (a, --- , a.) by 
(ha; , --+ , Aan) which evidently also satisfies the hypothesis; since ha;/h = a;/h 
the conclusions are the same. Proceeding on this assumption and excluding the 
trivial case of our theorem in which r = 0, we have on multiplying hypothesis 
V,(n, r) through by h 


h= 2, as, 10+ Os, ™ Ae,4, *¢* ™ As,- 
(s 
The summation pr" is for all permutations (s) : (81, --- , Sn) of (1, ---,m), 
as above. 

There are evidently v = n!/r!(n — r)! distinct terms in this equation (i.e., 
terms not reducible to one another by the identities of Boolean algebra) ; let them 
be denoted in some conventional order by (uw, --- , u,). Of these terms there 
will be clearly 7 = (n — 1)!/(r — 1)! (n — r)! distinct ones involving a pre- 
assigned a; as factor; let them be (u;, , --- , ui,). We have moreover u,u = 0 
for all k ¥ 1, and of course uw V --- V uy =h#O0. Finally, as a result of the 
symmetry (cf. Theorem 1) uz./h  w/h for all k and 1. Thus (m, --- , us) 
forms a v-scale (AAP, §5, Definition 1) and if we set b = ui, V --- V wi, we 
have by AAP Theorem 20 that b/h is appraisable and p(b/h) = 1r/v = r/n. 
Thus our theorem will be proved once it is established that b/h = a;/h, i.e., 
that hb = ha;. This is shown on multiplying the equation 


wm V--»-Vm=bV uj, V--- V uy, =A 


through by a; : clearly if u;, does not have a; as a factor, it does have ~a;; 
so that awu;, = 0, ete. 


Corotuary. If (a,,--- , an), hand iy, de, a.,), h' each satisfy the hypothe- 
sis of Theorem 8 for the same value of r, then a:/h ~ a; /h’. 
THeoreM 9. If (a;, --- ,@n) is symmetric with respect to h and tf at least r 


(0 S$ r S 1) of tts elements are implied by h in the sense that 


Xiln,r):hC Dy day ++ Gar 
(81,°°*» Sr) 











Fae Bias 
4 ti, 
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(where the summation takes in all possible sets of r distinct integers in (1, --- , n)), 
then for each value of i, p,(ai/h) 2 r/n. 

Again we assume:a; C h (i = 1, --- , n) and disregard the trivial case r = 0, 
Multiplication by h puts X;,(n, r) into the form h = bs Ms, +++ Qs,. We set 


(1) Op = Dy ny +++ Day ~ apy 22° ay; 
a (8) 


here as in V;,(n, r) the summation is. for all permutations (s): (81, --- , 8,) of 
(1,---,m). Simple algebraic manipulations establish 


(2) h= >; CoCo = 0 (p ¥ 0); he, = Cp. 
p=r 
Let p be a value for which c, ¥ 0: since h ¥ 0, such values will occur. We 
will show that (a; , --- , @n) is symmetric with respect to c,. In order to prove 


(3) Gi, -*- di, /Cp Yay: a;,/Cp 
we first introduce the Boolean polynomial 


ae eee = 7 eee tH Da ty eee Ls, ™ Ls,414 eee ™ Xs, 
& 


and apply Theorem 1, whereupon the equation 

O(Liy 0+ Vig y Liga 0+  Lig)/R AS Oi y +++ Lig Ling 2+» Liq)/h 
is obtained (all subscripts in each set being distinct). But this may be written as 
(4) Cp0i, --+ Ai,/h XS C,4;, --- a;,/h. 


Now if either c,a;, --- ai, = 0 or c,a;, --- a;, = 0, (4) shows that both these 
equations hold, and (3) becomes evident. In all other cases we apply Axiom D 
(sharpened) to (4) together with c,/h ~ c,/h (with a, = a2 = c,, hy = he = h, 
bi = aj, --+ ;,, be = aj, --+ a;,) whereupon (3) is established. 

This symmetry and the relation (1) show that (a, --- ,@,) satisfies the 
hypothesis of Theorem 8 with (p, c,) replacing (r, h); it follows that a;/c, is 
appraisable and p(a;/c,) = p/n (¢ = 1,---,n). 

Let o be any given positive integer, and (uw, --- , Une) an no-scale. Set 
v= Wm V-es V Uni, U = UW V --+ V Une; then p(v/u) = (ro — 1)/no = 
r/n — 1/ne (AAP Theorem 20). Hence for each p for which c, # 0 we have 
a;/c, > v/u (AAP Theorem 16). 

Returning to (2), we may write h = b dy c, where the summation p hag includes 
only those terms of (¢c,,---,¢n) which * 0. By applying Axiom P” to 
a:/c, > v/u we derive the result that a;/>,’ c, = a;/h > v/u. Hence by defini- 
tion of p,(ai/h) we have for all values of the positive integer o that p, (ai/h) 2 
r/n — 1/no, which leads at once to the conclusion of our theorem. 

THEOREM 10. If (a, --- , an) is symmetric with respect to h and if h implies 
the truth of at least t and falsehood of at least f of its elements in the sense that 
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Xi(n, t, f): hc p> Ap, +** Ap, ™ Aq, +++ ™ Ag 
P,@ 


(where the summation takes in all possible sets of t + f distinct integers from 
(1 we ,n)), then 


’ 


t 
(5) ~ S$ p,(a/h) S p*(ai/h) <1 4 
The first inequality in (5) results from the application of Theorem 9 (r = 2) 


inasmuch as it is clear that 


hc > Gp, +++ Ap, ™ Aq, -*> ~ dey SD as, ++ As, 
(p,@) s) 


To obtain the last inequality in (5) we have but to replace (a, --- ,a,) by 
(~a,,--+, ~@n) (ef. Theorem 1, Corollary) and apply AAP Theorem 17 to 


the resulting inequality 
f/n & p,(~ai/h) = 1 — p*(a;/h). 


THEOREM 11. Let the infinite sequence (a; , Q2, --- ) be a symmetric set with 
respect to h, and let the limiting frequency (statistical weight) of its elements implied 
by h be w in the sense of the assertion 


Ww): Ul > Ul Wilu, BM, n), 


p=1 m=1 n=m 
in which W,(w, u, n) coincides with the assertion X;(n, t, f) in Theorem 10 in 
which t = t(w, uw, n) is the least integer => nw — n/p and f = f(w, yu, n) ts the 
least integer = n(1 — w) — n/p. 

Then a;/h is appraisable and p(a:/h) = w (i = 1, 2, --- ). 

Let us choose a value of » and hold it fast. Then in virtue of the hypothesis 
W,,(w) there exists an integer m for which | ail W,(w, u,n) is valid. For definite- 
ness we select the least m for which this is true (we may as well represent it 
by m); then we have for this m and, e.g., for n = m the assertion W,(w, u, n). 
Hence by Theorem 10 we conclude that 


t 
nn) S p,(ai/h) <= p*(a;/h) = 1 — iene: 
This yields on referring to the definitions of i(w, », n) and f(w, u, ) 


w— : S p,(a:/h) S p*(a;/h) S w+ .. 


This equation, being true for every u = 1, 2, --- , leads at once to the conclu- 
sion of our theorem. 

Corottary. Let (a,, a2, ---) be symmetric with respect to h and let the 
limiting frequency of its elements implied by h exist and lie between w; and we in 
the sense of the assertion > a <wsw, Wx(w). Then a;/h is appraisable and w; S 
P(ai/h) S we (¢ = 1, 2, --- ). 
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THEOREM 12. Let (a: , @2, --- ) be symmetric with respect toh and (by , be, -.. ) 
be symmetric with respect tok. Let the number of a’s implied by h eventually become 
and remain greater than the number of b’s implied by k in the sense of the assertion 


«o x n— 


1 
2 STE ec Day a GOD ye yd 
Then for every i, j: ai/h > b;/k. 

As usual (s,,--- , 841) is a set of r + 1 distinct integers chosen from 
(1, ---, 7), all possible sets of this kind being taken in the >>; similarly for 
(t;, -+- ,¢tn—r) and the corresponding )>. 

Let m denote an integer for which []%_, --- is true by assertion (for definite- 
ness, the least m). Let n denote the greatest of (m, 7,7). With this determina- 
tion of n we have the assertion }>’=> { }. Let r denote a value for which 
assertion {| } is made. Then we have simultaneously 


he a ke > ~bi, «+» ~di,-,- 


We are now in a position to apply Theorem 9 to each of the sets (a1, - ++ , dn) 
and (~b,,---, ~b,), the latter of which is symmetric by the corollary to 
Theorem 1, and so obtain 


oe F 


py(ai/h =" 2, py(~b/k) & 


But p,(~b;/k) = 1 — p*(b;/k), and hence p,(a;/h) > p*(b;/k), from which 
the conclusion of our theorem follows (AAP Theorems 17 and 16). 

In concluding this section we may consider the question of replacing in the 
hypotheses of these theorems the requirement of symmetry by that of nu- 
merical symmetry. At first sight such a paraphrase would have the advantage 
of weakening the condition of symmetry; but this advantage is in our opinion 
far offset by the disadvantage that appraisability has to be assumed in the 
hypothesis—and it is indeed one of the chief functions of the theorems of this 
section that they afford the property of appraisability in their conclusions. 
Furthermore, symmetry rather than numerical symmetry appears to be the 
natural assumption to make in the applications. Nevertheless the quanti- 
tative conclusions subsist when numerical symmetry is assumed instead of 
symmetry. We confine ourselves to a statement of the following examples. 
The proofs, which proceed along rather similar lines to those of the earlier 
theorems and with the use of the classical properties of numerical probability, 
are omitted. 

THEOREM 13. If (a, +--+ , dn) ts numerically symmetric with respect to h and 
if assertion X,(n, t, f) is made, then t/n S p(ai/h) S$ 1 — f/n. 

THEOREM 14. If (a,, a2, ---) is numerically symmetric with respect to h 
and tf assertion W,(w) is made, then p(a:/h) = w. 

THEOREM 15. If (a, , a2, --- ) is numerically symmetric with respect to h and 
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(b;, be, --- ) #8 numerically symmetric with respect to k and if assertion Q is 
made, then p(ai/h) 2 p(b;/k). 

In the theorems of this section the symmetry rather than the independence 
of the sets of propositions (a; , a2, --- ) appears to be the important notion. 
That such an appearance is deceptive when regard is had to the applications 
(the trials of event E) is realized when the possibility is considered that the 
symmetry of (a, @2,--- ) with respect to h does not guarantee that of (a4, 
Qn42, -+: ) With respect to ha; --- a, : 1.e., the results of the first n trials of 
being known, we may not be in a position to apply our theorems to the others. 
Thus it would appear that the hypothesis of Theorem 4 affording both sym- 
metry and independence is prescribed in such applications. 
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CHAPTER I. INTRODUCTION 


[t is the purpose of the present paper to give a consistent treatment of the 
problems of Markoff’s process and mean ergodic theorem from the standpoint 
of the theory of iteration of bounded linear operations in Banach spaces. 

Let 2 be an abstract space where a measure of Lebesgue type is defined, and 
consider a simple Markoff’s process, by which a moving point ¢ € Q is transferred 
stochastically inside 2. If we denote by P(t, Z) the transition probability that 
a moving point ¢ eQ is transferred into a Borel set # of & after the elapse of a 
unit-time, then we have always P(t, Z) = 0 and P(t, Q) = 1. We shall further 
assume that P(t, Z) is completely additive as a set function of Borel sets £ if 
t is fixed, and that P(t, Z) is Borel measurable in ¢ if EZ is fixed. Under these 
assumptions, the probability P‘” (t, Z) that a moving point ¢ ¢ @ is transferred 
into a Borel set EF of Q after the elapse of n unit-times is given recurrently by 


P(t, E) = [ P(t, ds)P(s, EB), n=23,...; P(t, E) = Pit, B, 
9 


where the integration is of Radon-Stieltjes type. 

The problem of Markoff consists in the investigation of the behavior of the 
sequence of the iterations P(t, E) and their arithmetic means Q(t, EZ) = 
1 n m 
: > , P(t, E) for large n. 

In the special case, when Q is composed of only a finite number (= N) of 
points (with equal measure x), our Markoff’s process is reduced to the classical 
one, and our problem is nothing but the investigation of the behavior of the 
iterations P" of a matrix P = (p;;), i, j = 1,2, --- , N, such that p;; 2 0, 2, 
j=1,2,---,N,and }3}.4 p;; = 1,2 = 1,2, ---,N. This case was first treated 
by A. Markoff in his classical papers, and in the last ten years many contributions 
to this problem were given by J. Hadamard, M. Fréchet, R. von Mises, A. 
Kolmogoroff, W. Doeblin, J. L. Doob and others. These results are all collected 
in a recent monograph of M. Fréchet [3], which is so complete that we find almost 
no need of giving further discussions on this case. We shall only remark that 
this case may be treated exactly in the same manner as in the general case. 

In the general case, when Q is a continuum,’ this problem was discussed by 





1 We shall confine ourselves in this paper only to the case when Q is the closed interval 
0 Sts1. This is not an essential restriction. 
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B. Hostinsky, M. Fréchet, J. L. Doob, W. Doeblin and N. Kryloff-N. Bogolioi- 
boff. B. Hostinsky [1] treated the case when P(t, Z) is given by a continuous 


density p(t, s): P(t, Z) = / p(t, s) ds, where p(t, s) is a continuous function of 
gE 
two variables t and s in Q X Q; M. Fréchet [1], [2] considered the case when 
p(t, s) is bounded and measurable instead of being continuous; J. L. Doob [1] 
weakened this condition by requiring only that p(t, s) is uniformly (in ¢) in- 
tegrable in s, that is, that there exists for any e > 0 an 7 > 0 such that mes(Z) < 
n implies | p(t, s) ds < efor any ¢ e€Q; and lastly, W. Doeblin [1] discussed the 
Ez 


case when the existence of the density p(t, s) is not necessarily assumed. He 
has only assumed that 


there exist an integer d and positive constants b, n such that mes(E) < n 
(D) implies P(t, E) S 1 — b for any t €Q. 


It is clear that the condition (D) of W. Doeblin is more general than that of 
J. L. Doob, and under this condition, W. Doeblin has obtained, by a direct 
(point-set-theoretical) method, a considerably precise result. 

In the present paper, we shall discuss these problems by another method, 
which is due to N. Kryloff-N. Bogoliotboff [1], [2], and which may be called 
an operator-theoretical method. Indeed, we shall treat these problems by 
considering P‘” (t, Z) as a kernel of an integral operator in some special Banach 
spaces. This may be done in two ways: firstly, 


2>T(2) =y: y(E) = [ 2(di) P(t, E) 


is a bounded positive’ linear operation which maps the Banach space (M)° 
into itself; and secondly, 


t—> T(x) =y: y= [ P(t, ds)y(s) 


is a bounded positive linear operation which maps the Banach space (M*)* 





* A bounded linear operation 7’, which maps a semi-ordered Banach space into itself, is 
called to be positive if z = 0 implies T(z) = 0. 

* (M) is the space of all the (real- or complex-valued) completely additive set functions 
2(E) defined for all Borel set E of 2. For any x(E) ¢ (M), its norm is defined by: || z || = 
total variation of | z(Z) | on @. In (M), a real-valued completely additive set function 
x(E) is called to be positive (and is denoted by z = 0), if z(E) = 0 for any Borel set E C Q. 

* (M*) is the space of all the (real- or complex-valued) bounded Borel measurable func- 
tions x(t) defined on @. For any x(t) ¢ (M*), its norm is defined by: || 2 || = l.u.b. | x(t) |. 

€ 


In (M*), two functions, which differ from each other at least at one point, are considered 

to be different; it is to noted that the Banach space (M*) is essentially different from (M), 

where the norm is defined by: || z || = ess. max. | z(¢) |. In (M*), a real-valued bounded 
te 


Borel measurable function x(é) is called to be positive if z(t) = 0 for any ¢ « Q. 
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into itself. If, in particular, P(t, Z) has the density p(t, s), then these operations 
become 


t—> T(x) =y: y(B#) = [ ( [ x(dt)p(t, »)) ds, 


r— T(r) =y: y= [ v s)x(s) ds; 


consequently, in the former case, 7' may also be considered as a bounded positive 
linear operation which maps the Banach space (L)”’ into itself: 


t—T(z) =y: y(s) = [ere s) dt. 


In all these cases, 7 and 7 are both of norm 1, and it will be easily seen that 
the iterates 7” and 7” of 7 and T respectively, which are defined by the iterated 
kernel P(t, E): 


t—> T(z) =y: y(E) = [ s@p° (t, E), 





r> T(x) =y: yf) = [ P (1, ds) (8), 


have also the same properties. Thus our problem is transformed into the in- 
vestigation of the behavior of the iterates 7” of a bounded positive linear opera- 
tion 7’, which maps the Banach space (M), (Z) or (M*) into itself. 

N. Kryloff-N. Bogoliodboff [1], [2] introduced the condition that 


f there exist an integer m and a strongly completely continuous? linear 
\operation V, which maps (M) into itself, such that || T”" — V || <1, 





(K) 


and under this condition they have obtained a remarkable result. In the present 
paper we shall develop this idea of N. Kryloff-N. Bogoliodboff and shall obtain 
a more precise result. This investigation will be carried out in §4, and our 
principal results may be summed up in the following two statements: 

(1) the condition (D) implies the condition (K), 

(2) under the condition (K), all the results of W. Doeblin can be obtained even 

in a more precise form. 

Besides the ergodic kernel (= ensemble final due to W. Doeblin) we shall 
introduce a new notion of ergodic part, and it will be shown that this also plays 
a fundamental réle in the investigation of the asymptotic behavior of the 
sequence P(t, E) for large n. 

In order to obtain these results, we shall develop in §§2 and 3 a general theory 
of iteration of bounded linear operations in Banach spaces. Although the 





5 (L) is, as usual, the space of all the (real- or complex-valued) measurable functions 
z(t) which are absolutely integrable on ©. For any z(t) « (L), its norm is defined by: 


I} 2 || = [ | x(t) | dt, and in (ZL), an integrable function z(t) is called to be positive if we 
2 


have z(t) = 0 almost everywhere in t. 
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iteration of bounded linear operations in concrete Banach space (for example, 
in Hilbert space) was discussed by many authors, such a theory has not hitherto 
been developed in general Banach spaces. We shall show now that this is pos- 
sible under general conditions, and the results thus obtained will find their full 
applications in §4. 

Thus §§2 and 3 may be considered as preparations to §4. Nevertheless, 
these are also the principal chapters of this paper. Indeed, Theorem 1 (mean 
ergodic theorem) and Theorem 4 (uniform ergodic theorem) are among the main 
results of our paper. In order to explain the meaning of these theorems, let 
us recall the mean ergodic theorem of J. v. Neumann: 

Let S be a one-to-one measure-preserving transformation of a space Q into 
itself. Then 

r—>Ta)=y: y(t) = x(S@) 
is a positive unitary transformation which maps the Hilbert space (L’) (defined 
on 2) into itself, and the mean ergodic theorem of J. v. Neumann says that, 


1 
for any x ¢ (L”), the sequence {z,} (n = 1, 2, ---), wherex, = 2 (T+ T°... + 


T")z, converges strongly to some element Z « (L’). 

This theorem may be generalized in two ways: firstly, (L”) may be substituted 
by an arbitrary Banach space (B), and secondly, 7’ may be substituted by an 
arbitrary quasi-weakly completely continuous’ linear operation which maps the 
Banach space (B) into itself. The positiveness of 7 and the existence of the 
inverse operation 7” are both not assumed. This theorem is called the mean 
ergodic theorem in Banach spaces and will be proved in §2.’ 

Moreover, if 7 is quasi-strongly completely continuous,” i.e., if 7’ satisfies 
the condition (K) of N. Kryloff-N. Bogoliodboff, then the strong convergence in 
the mean ergodic theorem may be substituted by the uniform one. This is a 
generalization of the results of M. Fréchet [1], [2], C. Visser [1] and N. Kryloff- 
N. Bogoliowboff [1], [2], and will be called the uniform ergodic theorem in Banach 
spaces. We shall prove this in §3. This theorem is extremely powerful in the 
investigation of concrete Markoff’s process in §4. 

In concluding the introduction, we express our hearty thanks to Yukio Mimura 
and Shékichi Iyanaga for their constant encouragements during the preparation 
of this paper. 

Carrer 2. Mean Ercopic THEOREM IN BANACH SPACES* 


(Generalization of J. v. Neumann’s mean ergodic theorem) 


In this chapter we are concerned with weakly completely continuous and 
quasi-weakly completely continuous linear operations in general (real or com- 





* This notion will be explained at the beginning of §2. 

"Recently, the generalizations of the mean ergodic theorem were given also by N. 
Wiener [1], [2], G. Birkhoff [2] and N. Dunford [1]. We shall discuss these results in §2. 
See the remarks after Theorem 1 of §2. 

*K. Yosida [2], S. Kakutani [1], K. Yosida and S. Kakutani [1]. 
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plex) Banach spaces. By definition, a bounded linear operation 7’, which maps 
a Banach space (B) into itself, is called to be weakly completely continuous if 
it maps the unit sphere || z || < 1 of (B) on a weakly compact set of (B). More 
generally, T' is called to be quasi-weakly completely continuous, if there exist an 
integer m and a weakly completely continuous linear operation V, which maps 


(B) into itself, such that || 7” — V || < 1. 
TuroreM 1. Let T be a bounded linear operation which maps a Banach space 


(B) into itself. Let us further assume that 
(2.1) there exists a constant C such that || T” || S C forn = 1, 2, --- , and that 


for any x ¢ (B), the sequence {x,} (n = 1, 2, ---), where x, = (7 - 
aaa T’ +... + 7'")z, contains a subsequence which converges weakly to a 
point & « (B). 
Under these assumptions, 
(2.3) the sequence {x,} (n = 1, 2, --- ) converges strongly to a point z, 


and if 
we denote by T, the operation x — #, then T, 1s a bounded linear operation which 
maps (B) into itself and 


(2.4) T,.=17T=Ti=7;, |INM|| sc. 


Proor. Let x be an arbitrary point of (B). By (2.2), there exists a subse- 
quence {Zn,} (v = 1, 2,--- ) of {an} (n = 1, 2, --- ) which converges weakly 
to a point < of (B). We shall first prove that we have 


(2.5) T(é) = Z. 
This is an easy consequence of the relation: 


| T@n) — 2a 


aa 1 2 13 +1 1 
(2.6) - |For ++... Oe 4 T+. eee 


_ 1 or |= 
= [Lar — m2] < © 21, 


which is valid for n = 1, 2,---. Indeed, putting n = n, in (2.6) and making 
v tend to , we have (2.5) at once; for {z,,} and {7'(2,,)} converge weakly to 
& and T() respectively. 

We shall next prove (2.3). For this purpose, decompose xz into two parts: 
z=iZ-+ (x — &). Then we have, by (2.5), 


(2.7) m= Et (C++... +) e—2), 


and consequently, in order to prove (2.3), we have only to prove that the 
second term of the right hand side of (2.7) converges strongly to 0 asn — ®. 
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Let the range of the linear operation J — T' (J is the identical transformation) 
be R, and denote its closure (in the strong sense) by R. Since R is a linear sub- 
space of (B), & is also closed in the weak sense. Since the sequence 


{1-i@ ++ ins +7} 


n—-1n 2-3 
n n 








= -N(1+ T+... +1 r)e 


converges weakly to z — &, and since the right hand side clearly belongs to R, 
z - & belongs to R. Consequently, in order to prove (2.3), we have only to 
prove that 


(2.8) x (T[+T’ +... + T")x converges strongly to 0 


for any ye R. This may be performed in two steps: 
Ist case. yeR. Taking a point ze (B) such that y = z — T(z), we have 





[Lotrt... + 2y| =|2@+ r+... +me-7e)| 


=|La@r- re | s 2 i210. 


2nd case. ye. For any e > 0 there exists a point y’ e R such that || y — 
y' || < «. For this y’ we have 


| 


+ [Rot r+ ETM —y’) 











Petre... +ry|s|iotrs..+ rv 





| 


+ Ce. 








< [tot r+ ooo yy’ 


Since the first term of the right hand side converges strongly to 0 asn — «© 
(by the result obtained in the 1st case), we have 





| SCs 








lim (P+ T? +... +T)y 
and, since « > 0 is arbitrary, we have (2.8). Thus (2.8) is proved for any 
ye & and hereby (2.3) is completely proved. 

Let now 7; be the transformation « — #. 17’; is clearly linear and bounded: 


| 7: || $C. Since (2.5) is always true, we have 77, = 7,. Thus T"7; = 
T, and =F +7 +...+ 7")T, = 7, for n = 1, 2, ---, and consequently 


Ti = T;. In order to prove the relation: 7:7 = 71, we have only to start 
from the inequality: 
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[kw+rt wee + T")T (a) ~ (+0 + --+ + 7")z|| s 2 llell 0. 


Since the terms on the left hand side converge strongly to 7;T(zx) and T,(z) 
respectively, we have 7:7'(x) = Ti(x) for any z « (B). 

Thus the proof of Theorem 1 is completed. 

Remark 1. 177) is a projection operator which maps (B) on the proper space 
(B;) of T belonging to the proper value 1.’ That is, 7; does not vanish identi- 
cally if and only if 1 is a proper value of 7’, and 7(z) = x if and only if we have 
T(x) = 2. 

Remark 2. In virtue of (2.1), (2.2) is surely satisfied if 7’ is weakly com- 
pletely continuous. Hence 

Corouuary 1. Let T be a weakly completely continuous linear operation which 
maps a Banach space (B) into itself. If there exists a constant C such that 


|| 7" || < Cforn = 1, 2,--- , then the sequence of operations ‘2 (T+Tl+...4+ 


T")> (n = 1, 2, --- ) converges strongly to a bounded linear operation T; which 


maps (B) into itself and which satisfies (2.4); t.e., mean ergodic theorem is valid 
in (B). 

The conditions of this Corollary are clearly satisfied if (B) is a Hilbert space 
and if T is a unitary operation init. (Since Hilbert space itself is locally weakly 
compact.) Thus Theorem 1 is, in two respects, a generalization of the mean 
ergodic theorem of J. v. Neumann [1], [2]. (See also E. Hopf [1].) In the first 
place, our theorem is true in general complex Banach space, while the proof of 
J. v. Neumann is valid only in the case of a Hilbert space. In the second 
place, we do not assume that 7’ is isometric, and the existence of the inverse 
operation 7” is not required. The positiveness of the operation 7’ is also not 
required. We have only to assume that the conditions (2.1) and (2.2) are satis- 
fied. The first condition (2.1) is always satisfied with C = 1 in the case of 
ergodic theorems and Markoff’s processes; and the second one (2.2) is satisfied 
in every weakly compact Banach space (for example, in regular Banach space 
and especially in (L”) (p > 1)) for any bounded linear operation 7. Conse- 
quently, mean ergodic theorem holds true in (L”) (p > 1) under the trivial con- 
dition (2.1). This result was also obtained independently by F. Riesz [1]. On 
the contrary, in the Banach space (L) = (L’), the condition (2.2) is not always 
satisfied for an arbitrary bounded linear operation T. In order that a bounded 
subset S of (L) be weakly compact, it is (necessary and) sufficient that the 
functions belonging to this subset S are uniformly integrable, that is, that there 
exists for any « > 0 a positive number 6 > 0 such that mes(Z) < 6 implies 





* It is to be noted that in general Banach space (B), there does not necessarily exist, 
for any closed linear subspace (B’), a projection operator (of norm 1 or of finite norm) 
which maps (B) in (B’). 
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[ | x(t) | dt < ¢ for any z(t)¢S. This condition is surely satisfied if there 
gE 


exists a function 2o(t) ¢ (LZ) such that |x| S 2 (that is, | z(t) | S 2(¢) almost 
everywhere in t) for any z(t) ¢ S. Hence we have 

Corotuary 2. Let T be a bounded linear operation which maps the Banach 
space (L) into itself. If there exists a constant C such that || T" || S C forn = 
1, 2,---, and af there exists, for any x(t) €(L), a function x(t) « (L) such that 


T"(x) S Xo (or, more generally, (T+T’+.--+T7")rs ta) forn =1,2,.--, 


then for any x(t) € (L) the sequence (2 (T++...+4+ rx} (n = 1,2,---) 
converges strongly to some function Z(t) € (L); t.e., mean ergodic theorem is valid 
in (L). 

In order to prove this, we have only to notice that there exists also an 24 
such that 7"(— xz) S amie. —2) S T"(z) forn = 1,2,.--. 

This theorem was proved by F. Riesz [2] and an analogous theorem was also 
stated by Garrett Birkhoff [1] for abstract (L)-spaces. The result of G. Birk- 
hoff is more general in formulation, but it was pointed out that in essential the 
two theorems are equivalent.” 

Moreover, if 7 is an integral operator with the probability density kernel 


p(t, 8): 


z— T(x) = y: y(s) = [ sons, s) dt 


(p(t, s) 20 for any t, s; [ p(t, s)ds = 1 for any 2d), 
2 


then a (necessary and) sufficient condition that 7’ be weakly completely con- 
tinuous is that p(t, s) is uniformly (in é) integrable in s, that is, that there exists 


for any « > 0 a positive number 6 > O such that mes(#) < 6 implies i p(t, 
Ez 


s)ds < efor any t. This condition was investigated by J. L. Doob [1] without 
being noticed that the linear operation 7’ becomes weakly completely continuous 
under this condition. We have once” treated this case of the condition of J. L. 
Doob as an application of the mean ergodic theorem. But, on looking precisely 
into the detail of the fact, we have found that, in this case, the linear operation 
T satisfies even the (in some sense stronger) condition (K) of N. Kryloff-N. 
Bogoliotboff, and that the uniform ergodic theorem is true in this case. (In- 
deed, the condition (K) of N. Kryloff-N. Bogoliowboff follows from the condition 
(D) of W. Doeblin, which is weaker than that of J. L. Doob.)” 

Remark 3. Recently Garrett Birkhoff [2] proved that in every uniformly 





©S. Kakutani [2]. 
"' K. Yosida and 8. Kakutani [1]. 
® K. Yosida [3]. 
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convex Banach space mean ergodic theorem is valid for any bounded linear 
operation of norm not exceeding unity. This is also a generalization of the 
mean ergodic theorem of J. v. Neumann, since Hilbert space is uniformly con- 
vex. But this result of G Birkhoff is contained in our Theorem 1; for, every 
uniformly convex Banach space is regular and consequently locally weakly 
compact.” (It is, however, to be remarked that the proof of G. Birkhoff dis- 
tinguishes itself by its extreme simplicity.) 

Remark 4. We can also prove the theorem of the following type: 

Corouiary 3. Let S and T be two bounded linear operations which map a 
Banach space (B) into itself, and assume that these are commutative between them- 
selves: ST = TS. If there exists a constant C such that || T” || S C and || 8S” || s 
C forn = 1,2, --- , and if, for any x € (B), the sequence {mn} (m,n = 1,2, ---), 
where 


tm = (S+S $+ +SHVT+ T+. +z, 


contains a subsequence {Xm,n,} (mM,—> ©, n,—> ©) which converges weakly to a 
point & of (B), then we have lim Xm, = & strongly for any x e (B); and tf we denote 


m™,n—->o 


the transformation x — & by U, then U is a bounded linear operation which maps 
(B) into itself and we have 


US =SU=UT=TU=U'=U and |[U|| <C. 


Proor. It will be easily seen (exactly as in the proof of Theorem 1) that we 
have S(z) = T(%) = #. Hence we have only to prove that the sequence 
(S+84+.---+S8)(T+P4+---+T)(e- a} converges strongly 


to 0 asm— « and n— o simultaneously. In order to prove this, denote by 
R the set of all the points z ¢ (B) of the form: z = (x — S(x)) + (y — T(y)) 
with z, ye (B). R is clearly a linear subspace of (B) and x — # belongs to its 
strong (= weak) closure R; for, we have 


-Lets+.. +S"\(T+T7? +... +7} 


= -LG+st.. +5")}2 


+2 6ts4..+8 


1 , , \ 
a SHS H+ SMT ET +. +P) 





1% §. Kakutani [3]. See also D. Milman [1]. B. J. Pettis {1}. 
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ear = -9(1+ 184 ™Fs 4... +19) 
the m m m 
sa l(s+s Ss") 
“dl + (S+S'+--- +S") - 7) 
kly 
lis- (r42itr etry. tho) 
n n n 
and the left hand side converges weakly to x — ~. Consequently, we have only 
D a 
m- to prove that tins (S+S+---+ 8S\(T+T4+.-.4+ r")z) converges 
s strongly to 0 for any ze R. This is clear if z eR, and the general case ze R 
) may be treated in exactly the same manner as in the proof of Theorem 1. 
Mean ergodic theorem of this type was first discussed by N. Wiener [1], [2]. 
N. Wiener treated the case when the inverse operators S and 7" both exist, 
and has also obtained the individual ergodic theorem of G. D. Birkhoff’s type. 
Recently N. Dunford [1] has also announced the theorem of the same kind. 
a THEOREM 2. Under the same assumptions as in Theorem 1, we have: for any 
ote complex number of absolute value 1, there exists a bounded linear operation T) , 
se which maps (B) into itself, such that 
Ve ae x T” 
(2.9) z « + 3 +... + r) converges strongly to T,, 
(2.10) T.T=TT=d7,, Tr=Ts, (||Trll SC, 
we 
0 (2.11) \#yu implies T,T, = 0, 
ly (2.12) '. ¥ 0 af and only if d is a proper value of T. 
Oy In this case, T, is a projection operator which maps (B) on the proper space (B,) 
)) of T belonging to the proper value \, and T(x) = x if and only if we have T(x) = 
, » k 
ts Ax. If we further put T’ = T — DAT for any system {\, 2, --- , Ax} of 
i=1 
complex numbers of modulus 1 with \; # ; (i ¥ j) and T,, # 0, then we have 
(2.13) T,,T’ = T'T,, =0 for ~=1,2,---,k, 
(2.14) TT’ = T'T =T”, 
k 
(2.15) T= NTN, +7" forn =1,2,---, 
i=1 
(2.16) { \ is a proper value of T’ if and only if it is a proper value of T and 
\A ¥ Xi fort = 1,2, --- ,k. 
t 


: ROOF. (2.9) is clear from Theorem 1 if we consider 7'/) instead of 7’. From 
this (2.10) follows at once. To prove (2.11), multiply (2.9) by 7, from the 
right. Then we have, by (2.10), 
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ee nad 














KOSAKU YOSIDA AND SHIZUO KAKUTANI 


2 n 
u (¢ ee “) T, converges strongly to 7, T,. 
n\xX \" 
Since, in case \ ¥ yu, the left hand side converges strongly (even uniformly) to 
a zero operator, we have (2.11). (2.12) is clear from the Remark 1 to Theorem 
1. (2.13), (2.14) and (2.15) may also be proved by easy calculations. To 
prove lastly (2.16), let 4, || # 0, be a proper value of 7 and assume that 
} ~ i; fori = 1, 2,---,k. Then there exists a point 2 ¢ (B) (1% ¥ 0) such 
that Tao) = Ax. Therefore, multiplying the both sides by 7, from the left, 
we have, by (2.10), \:7y,;(#0) = AT, (Zo) and consequently (since d 9 Xi we have 


Ty, (%) = O for? = 1, 2, --- ,k, so that we have Ax = T(x) = : ® AiT, (to) + 
i=1 


T’(2) = T’(ao). Hence dis a proper value of 7”. Conversely, let \ be a proper 
value of T’. Then there exists a point 2% €(B) (a # 0) such that 7’(a) = 


AZ. Since we have T(x) = (; r (x) = SIT" (%) = oT *(ao) (by (2.14)) = 
AX , X is also a proper value of 7. In order to prove that A ¥ A; for 7 = 1, 2, 


. , k, assume that \ = \; for some z. Then we have 


2 n 
(747 4 dine +r) a= forn = 1,2,..- 
n ri Aji Xj 


and consequently (by (2.9)) 7,,;(2%0) = 2% # 0, which leads to a contradiction, 


since T,,(2e) = Tr, (; v"()) - > Ta 1" (a) = 0 by (2.13). 


The proof of Theorem 2 is hereby completed. 

THEOREM 3. In Theorems 1 and 2,the condition (2.2) may be substituted by 
the condition that T is quasi-weakly completely continuous. 

Proor. We have only to prove that, under the condition (2.1) and the as- 
sumption that 7 is quasi-weakly completely continuous, there exists for any 
x e (B) a subsequence {z,,} (v = 1, 2, --- ) of {x,} (n = 1, 2, --- ) which con- 
verges weakly to a point t¢«(B). In order to prove this, put T” = V + D 
with || D|| = a < 1. Then we have 
(2.17) T’" = V,+ D’, 
where V, = 7?” — (T” — V)’ is weakly completely continuous with Vi = V 
and || D’ || < a.“ Hence, putting 





“ If V and W are weakly completely continuous, then V + W, 7'V and VT are also weakly 
completely continuous for any bounded linear operation T. Thus the totality of all the 
weakly completely continuous linear operations constitutes an ideal in the ring of all 
bounded linear operations. Expanding T7™ — (T™ — V)?, the term T?™ vanishes and there 
remain only those terms (finite in number) which contain at least one V-factor. Con- 
sequently V, is weakly completely continuous. 

It is to be remarked that the same is also true for strongly completely continuous linear 
operations. 








y) to 
‘orem 
To 
that 
such 
left, 
have 


0) + 
“oper 
) = 
)) = 
1, 2, 


Lion, 


1 by 


» aS- 
any 
0n- 
- D 


ikly 
the 

all 
ere 
‘on- 


ear 





TREATMENT OF MARKOFF’S PROCESS 199 


Zam a * (Tr + T+ ile + T™)x 
for brevity’s sake, we have for n > pm 


Ce ee 


(T+ Tp. Tet l OMT ET +. $s 


si— 


= In,pm + ae = Ln,pm + V p(en,n—pn) + } | og | on 


Since || Zn,n—pm || & C-|| 2 || for any n > pm (by (2.1)), there exists, for any p, 
a subsequence {n,} (vy = 1,2,---) of {m} (m = 1,2,---) such that 
\Vn(2n,.n,-pm)} (v = 1, 2,--- ) converges weakly to a point Z«(B). Con- 
sequently we have 

Tim | f(x»,) — f@p) | S lima |f(@n,.pm) | + lim | f(D? (rn, .»,~pm)) | 


yoo 


for any bounded linear functional f(x) defined on (B). Since 





|S enyom) | SFU tarom|l S liFi|-F2-€ ll || — 0, 
and 
[MDP eay.nyvm)) |S [SMI DP ill 2o,.ae—om ll SS Il “2? Cel 2 Ih | 
we have ) 
(2.18) lim | f(tn,) — f@p) | S || f||-<?-C || || 


vo 


for any bounded linear functional f(x) defined cn (B). 

Applying the diagonal method, we may assume that (2.18) holds for any 
bounded linear functional f(z) defined on (B) and for p = 1, 2,---. Of course, 
t, may depend of p. Let us now consider the sequence {Zp} (p = 1, 2, --- ). 
From (2.18) we have 


| f(E») — f(%) | S (e” + a%)-|/F || C || 2 II, 
and since f(z) is an arbitrary bounded linear functional defined on (B), we have 


|| > — _ || S (a + a%)-C.|| x || for any pand g. Consequently, (since a < 1), 
itp} (p = 1, 2, --- ) is a fundamental sequence in (B). If we put = = lim Z,, 


p-co 





then we have, for any p, (from (2.18)) 


lim | f(@s,) — f(@)| $ lim |f(2s,) — f@») | + |fG) — f@| 


vo 


lA 


If ll-a?-C ll x{] + If ll-ll@, — #]I.- 
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Since p is arbitrary and since the right hand side tends to 0 as p — ©, we have 
lim | f(2n,) — f(@) | = 0. Since f(x) is an arbitrary bounded linear functional 


yoo 


defined on (B), we have thus proved that the sequence {%n,} (v = 1, 2, ...) 
converges weakly to . The proof of Theorem 3 is hereby completed. 
Remark. All what we have obtained in Theorems 1 and 2 is also valid for 
quasi-weakly completely continuous linear operations. We can also prove 
several propositions concerning the conjugate operators of such bounded linear 
operations.” But we shall not go into the detail, which is not essential in the 
further discussions of this paper, and we shall proceed to the case of strongly 
completely continuous and quasi-strongly completely continuous linear 


operations. 
Cyaprer 3. Unirorm Ercopic THEOREM IN BANAcH Spaczs”® 


(Generalization of the theorems of Fréchet-Kryloff-Bogoliodboff) 


In this chapter we shall discuss strongly completely continuous and quasi- 
strongly completely continuous linear operations. By definition, a bounded 
linear operation 7’ of a Banach space (B) into itself is called to be strongly 
completely continuous if it maps the unit sphere || x || S 1 of (B) on a strongly 
compact set of (B). More generally, 7 is called to be quasi-strongly completely 
continuous, if there exist an integer m and a strongly completely continuous linear 
operation V, which maps (B) into itself, such that || 7” — V || < 1. 

Since every strongly completely continuous linear operation is .a fortiori 
weakly completely continuous, all what we have obtained in §2 is also valid 
for the case of strongly completely continuous or quasi-strongly completely 
continuous linear operations. Moreover, we can show that in the present case, 
the strong convergence in Theorems 1, 2 and 3 may be substituted by the 
uniform one. These are, as will be seen from the proofs below, the consequences 
of a theorem of F. Riesz and its generalizations to the case of quasi-strongly 
completely continuous linear operations (Lemmas 3.1 and 3.2). 

THEorEM 4. Let T be a strongly completely continuous or quasi-strongly com- 
pletely continuous linear operation which maps a Banach space (B) into itself. If 
there exists a constant C such that || T" || < C forn = 1, 2, --- , then the proper 
values of T of modulus 1 (if such proper value ever exists) are finite in number 
and each of them is of finite multiplicity. Let us denote these by \1, \2, --- » \- 
Then there exists a system of strongly completely continuous linear operations 
Ty, , Trg, +--+, Tr, (each of them ¥ 0), and a strongly completely continuous or 
quasi-strongly completely continuous linear operation S ( which might vanish), such 
that T" is decomposed into the form: 


k 
(3.1) T= DIM, +S, n=1,2,---, 


i=1 





16K. Yosida and S. Kakutani [1], K. Yosida [4]. 
°K. Yosida [1], [4], S. Kakutani [1]. 
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with 
TT, = 1; T =MTi,, Ty, =T1,, Th, T),; = 0 (i ¥ 9), 
3.2 
(3.2) T,,8 =S%,=0, ||Tx || SC, t=1,2,.--,k, 
and 
M 

s —— = bee 

(3.3) |S” || = (1 + .«)”’ n 1, 2, ’ 


where M and «¢ are positive constants which are independent of n. 

Remark. Theorem 4is a generalization of the results of C. Visser, M. Fréchet 
and N. Kryloff-N. Bogoliotboff. C. Visser [1] discussed the iteration of strongly 
completely continuous linear operations in Hilbert space, and has obtained 
only strong convergence. M. Fréchet [1], [2] discussed the case of an integral 
operator 7’ with bounded density kernel p(t, s): 


r>T(2) =y: y(s) = [ x(t)p(t, 8) dt, 


and has obtained a considerably precise result after somewhat long calculations. 
It was, however, shown by Y. Mimura and the authors (Lemma 4.5 in §4) that, 
for such integral operation 7’ with bounded density kernel, the second iterate 
7” already becomes strongly completely continuous as an operator which maps 
the Banach space (ZL) into itself. Hence the theorem of M. Fréchet may be 
considered as a special case of Theorem 4. N. Kryloff-N. Bogoliodboff [1], [2] 
treated the case of an integral operator with general probability kernel P(t, £), 
which maps the Banach space (M) into itself: 


2 T(z) =y: y(E) = [ x(dt) P(t, B), 


and have obtained the same results (they announced these without proof) under 
the same conditions as in Theorem 4. It is, however, to be noted, that Theorem 
4 holds true even for the general case when the Banach space (B) is arbitrary 
and when the positiveness of the operation 7 is not assumed. The more de- 
tailed discussions of such operations and their iterations, where the condition 
of positiveness plays its essential réle, will be fully developed in §4. 

In order to prove Theorem 4, we shall first prove two Lemmas which may be 
considered as generalizations of the results of F. Riesz [1]. 

Lemma 3.1. If T is quasi-strongly completely continuous, then the proper 
values of T do not accumulate to a point not interior to the unit circle || = 1 of 
the complex plane. 

Proor. Take an integer p so large that we have a’ < } where 7” = V + D, 
||D|| =a<1. Wehave (exactly as we have obtained (2.17) in §2) 


(3.4) 7" =V,+D", ||D’|| Ss a’ <}, 


where V, is a strongly completely continuous linear operation which maps (B) 
into itself. From this it will be easily seen that we have only to consider the 
case when m = 1 anda < }, 
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In order to prove Lemma 3.1 in this case, let {An} (m = 1, 2, --- , An ¥ Ay 
(n * m)) be a sequence of proper values of 7’ which converges to a point \, 
of modulus not smaller than 1. We shall deduce a contradiction from these 
assumptions. For this purpose, let {z,} (nm = 1, 2, --- ) be a proper element of 
T corresponding to the proper value A, : 7'(%n) = Ankn, In ~ ON = 1,2,..., 
We shall first prove that these x, are mutually linearly independent. This 
may be done by mathematical induction. Let 2, 22, ---, 2n-1 be already 
linearly independent and assume that x, depends linearly on 2 , 22, +--+ , 2,1: 


n—1 


t, = > jr ai;. Then we have 


2 n—l n—1l 


i=] i=1 i=1 
Since {a;} (¢ = 1, 2, --- ,m — 1) do not vanish simultaneously, this is a con- 
tradiction to the assumption that 2, 2%,---,2%n,-1 are mutually linearly 
independent. 

Thus we have proved that z, (n = 1, 2,--- ) are mutually linearly inde- 
pendent. Consequently, the (n — 1)-dimensional subspace (X,-1) of (B), 
which is spanned by 21, 22, --- , %n-1, is a true subspace of the linear space 
(X,) which is spanned by 2, 22, ---,2%n. Hence, by a well-known theorem 


of F. Riesz [1], there exists a sequence of points {y,} (n = 1, 2, --- ) such that 
yn € (Xn), || yn || = land || y, — x|| > 4 for any xe (X,_1). Since each y, is 
of the form: yn = shu ait; , we have 
n—l 
-r(¥ is) = > ati — a = ~ P(x) = > « ‘(1 ad a) Ti € (Xn-1) 
t=1 imi A i=] n 
and 


r(¥ te) = Yr nm) = Sas se (Xm) 


i=1 Am 


for any n and m. Consequently, the trivial identity: 


ial ' (+) am 1 (t=) = Yn — {(u- - r(%)) + r(¥)} 
oo 70) 


\>4 for n> ™m. 





On the ay hand, since V is strongly completely continuous and since the 


(n = 1, 2, --- ) is uniformly bounded, there exist two sequences 


r()- 


= 0. This is, however, a contradiction with (3.5), since we have by 


sequence - 





of integers {n,} and {m,} (n, > m,, v = 1, 2, ---) such that lim 


Ym, 
" (i=) 


assumption 
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{r(t-) - 2 (te-)} - {v (Hes) - v (He) } 

| (%)-v(%)|+|7()-v()| 
>) + >) s<(EI+ HE) <4 


|< An, | | No 

Thus Lemma 3.1 is completely proved. 

Lemma 3.2. If T is quasi-strongly completely continuous, then the proper 
space (By) of T belonging to a proper value d of T' of modulus | is of finite dimension. 

Proor. This may be done in exactly the same manner as in the proof of 
Lemma 3.1. We may again assume that m = 1 anda < } 

If, for some proper value \ with |\| = 1, the proper space (B,) is not of 
finite dimension, then there exists a sequence of points z, € (B,) such that 
an || = 1, T(an) = Arp(n = 1, 2, --- ) and || 22 — 2m|| > 3 forn > m. 

On the other hand, since V is strongly completely continuous, there exist two 























lA 





I} 


Ym, 
| Am, 





























sequences {n,} and {m,} (n, > m,, v = 1, 2, --- ) such that lim || V(z,,) — 
V(rm,) || = 0. This will, however, lead to a contradiction since we have 
3 < || tn, r Im, || = | A(Xn, Im,) || -_ || T (Xn,) _— T(2m,) | 


S || V(en,) — V(am,) || + || D@n,) — D(m,) || 
S || V@n,) — V(aem,) || + 2a. 


The proof of Lemma 3.2 is hereby completed. 

Proor or THEOREM 4. Now, in order to prove Theorem 4, let us recall 
that Theorem 3 is valid in our case. From Lemma 3.1, there exists only a finite 
number of proper values \ of 7’ on the unit circle |\| = 1. Let us denote 
these proper values by 1, Ax, --- , Ax. Then there exists (by Theorem 2) for 
each \; a bounded linear operation 7), , which maps (B) on the proper space 
(B,,) of T belonging to the proper value \;. 7), satisfies all the conditions of 
Theorem 2 and is strongly completely continuous since each (B,;) is of finite 
dimension by Lemma 3.2. Let us put 


(3.6) =T- > Th, 

and consider the bounded linear operation S thus defined. We shall prove that 
this S is also quasi-strongly completely continuous and that its iterations S" 
are uniformly bounded. This is an easy consequence of the relation S" = 
T — ia APT) ;, which follows immediately from (3.6) and the result of Theorem 
2. Indeed, there exists a constant C’ = C + )oi-1 || he || S (k + 1)C such that 
|S" |] S|] 7" |] + Sha || Ty, || S C’ for n = 1, 2, --- , and also an ae m 
and a strongly completely continuous linear squiuiin Vv’ =V-—- int x? 
such that || S" — V’|| = || 7" — V|| <1. 
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Moreover, S has no proper value of modulus 1, so that in order to complete 
the proof of Theorem 4, we have only to prove the following 

Lema 3.3. If, in addition to the assumptions in Theorem 4, T has no proper 
value of modulus 1, then there exist two constants M and ¢ > 0 such that 

jai M ~ 
(3.7) \|7" || s ato for n=1,2,...- 

Proor. It is sufficient to prove the case m = 1. For, if Lemma 3.3 is true 
for the case m = 1, then there exist two constants M’ and é’ > 0 such that 
ir" || < M’/(1 + &)" for n = 1, 2, --- , and it is easy to deduce from this 
the existence of two constants M and e > 0 which satisfy (3.7). 

Now, in order to prove Lemma 3.3 in the case m = 1, put 7 = V + D, where 
V is strongly completely continuous and || D|| = a < 1. By a well-known 
result, J — \D (J is the identical transformation) admits a unique inverse J + 


AD(d) = I + DO%-1 A"D" which is regular in || < , and we have (I + AD(d)) 

(I — AT) = (I+ AD(A))U — AV — AD) = I — AV — NDA)V = 1 - VA), 

where V(A) = AV + A’D(A)V is regular in ) for | \ | < a and is strongly com- 
a 


pletely continuous for each \ with |A| < > By Lemma 3.1, there exists a 
a 


positive number 7 > 0 such that 7’ has no proper value \ in 1 — 7 < |A| < 


1+. Put 2e= min (: — 1, ") and consider the domain A: 1 — 2e < |A| < 
Qa 


1 + 2e. The equation (I — V(A))z = 0 has no non-trivial solution x ¥ 0 for 
each ¢€A. For, if there exists an 2%) ¥ 0 with (I — V(A))ao = 0, then we have 
(I — AT) = (I — »D)(I — V(A))ao = 0, and this is a contradiction since T 
has no proper value in A. Consequently, since V(A) is strongly completely 
continuous for any \ «A, there exists, by a theorem of F. Riesz, a unique in- 
verse J — K(d) of I — V(A) for any ) €A; and it will be easily seen that I — 
AR(A) = (I — K(A))Z + AD(A)) is an inverse of I — AT for each d € A. 

Thus we have proved that J — AT hasan inverse for any \ «A. Since it is 
clear from the uniform boundedness of {7'"} (n = 1, 2,---) that J — AT has 
an inverse I + }°%..\"7"" for any \ with || < 1, we have thus proved the 
existence of an inverse (I — \T)~ for any \ with |\ | < 1+ 2e. Consequently, 
by a theorem of M. Nagumo [1], (J — 7)“ is regular in \ for | A | < 1 + 2€ 
and the series of C. Neumann: (J — XT) = I + }°%_, "7" converges in the 
uniform sense in |\| < 1 + 2e. Hence there exists a constant M such that 
the inequality (3.7) is valid for n = 1,2,---. i 

The proof of Lemma 3.3 and herewith the proof of Theorem 4 are completed. 

Corotuary. Under the same assumptions as in Theorem 4 we have: 

(i) For any complex number d with || = 1, there exists a strongly completely 
continuous linear operation T, , which maps (B) into itself, such that 
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if? ot ss || ~ M 
oe ain <m cece — — Qik gs = eee 
‘e ( + 2 + + r=) Ty | ae for n 1, 2, ’ 





where M is a constant which is independent of n, and T) does not vanish identically 
if and only if 1s @ proper value of T. 

(ii) In order that the sequence {T"} (n = 1, 2,---) converge in the uniform 
sense to a zero operation, it 7s necessary and sufficient that T has no proper value 
of absolute value 1. 

(iii) In order that the sequence {T"} (n = 1, 2, --- ) converge in the uniform 
sense to a bounded linear operation T, ¥ 0, it is necessary and sufficient that 1 
is a proper value of T and that T has no other proper value of absolute value 1. 

In (ii) and in (iii), if the sequence {T"} (n = 1, 2, --- ) converges in the uniform 
sense, then it ts of the order of geometrical progression; that is, there exist a constant 
M and a positive number ¢ both independent of n, such that we have || T” || < 
M/(1 + ©)” and || T" — 7, || S M/(1 + ©)" respectively forn = 1,2, ---. 


Cuaprer 4. MarkKorr’s Process” 


§4.1. Introduction. Let us denote by P(t, Z) the transition probability that 
a point ¢ of the unit interval 2 = (0, 1) is transferred, by a simple Markoff’s 
process, into a Borel set E of © after the elapse of a unit-time. Wehave always 
P(t, EZ) = 0 and P(t, 2) = 1. We shall assume that P(t, £) is completely 
additive for Borel sets E if ¢ is fixed, and that P(t, HZ) is Borel measurable in ¢ 
if E is fixed. Then the transition probability P‘”(t, E) that a point t eQ is 
transferred into a Borel set EF of Q after the elapse of n unit-times is given 
recurrently by 


P(t, Z) = [ P°-%(, ds)P(s, E) = [ PuayP"G, B), 
2 2 


(4.1) a = 28, .-. 
| P(t, E) = Plt, B), 
where the integration is of Radon-Stieltjes type. 

Consider the complex Banach space (M) of the complex-valued com- 
pletely additive set functions x(E£) defined for all Borel set EF of 2. For any 
«(Z) «(M), its norm is defined by: || z|| = total variation of | z(EZ) | on Q. 
Then we have 

Lemma 4.1. The integral operator 


(4.2) meer ee [ 2(dt) P(t, E) 


is a bounded linear operation which maps the Banach space (M) into itself and 
| 7 || = 1. 





7 K, Yosida and S. Kakutani [1], K. Yosida [3], S. Kakutani [4]. 
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On the other hand, if we consider the complex Banach space (M*) of all the 
complex-valued bounded Borel measurable functions x(t) defined on Q, with 
|| « || = Lu.b. | 2(é) | as its norm, then we have 

te 


Lemma 4.2. The integral operator 
(4.3) t—> T(z) =y: y® = [PG ds)x(s) 


is a bounded linear operation which maps the Banach space (M*) into itself and 


| 7 || = 1. 
Lemna 4.3. For any x(E£) € (M) and y(t) « (M*), we have 


(4.4) [ xa ([PG avy) = [ nada, 


cin adin « [ x(dt)P(t, E). 


These three Lemmas are clear from the properties of P(t, £). 

Remark. By virtue of Lemmas 4.1 and 4.2, P‘” (t, E) can be defined recur- 
rently by (4.1). Hence P(t, E) is completely additive for Borel sets E if t 
is fixed, and P‘”(t, Z) is Borel measurable in ¢ if E is fixed. Clearly we have 


(4.5) P”(i,ZE) 20, P(t, 2) = 1, n=1,2,..-. 


Moreover, by the repeated use of Lemma 4.3, we have 


(4.6) Pp" (, B) = / P™ (t, ds)P (s, E) 
Q 


for any m and n, and it will be easily seen that the iterated operators 7” and 7” 
are defined by the kernel P‘”(t, Z). We have clearly 


It is now the purpose of this chapter to investigate the asymptotic behavior 
of the sequence {P‘”(t, Z)} for large n. We shall treat this problem by con- 
sidering P‘”’(t, E) as the kernel of the integral operators 7” and 7”. Since the 
Banach spaces (M) and (M*) are not conjugate to each other, these two operators 
T” and 7” are not the conjugate operators to each other in the strict sense 
which was given by S. Banach [1]. But in essential, these play the same rdle. 

Our problem is not quite easy if we have no further assumptions on the 
kernel P(t, Z). Our fundamental assumptions are the conditions (D) and (K) 
which were stated in §1. The first condition (D), which is due to W. Doeblin 
[1] is more general than those given by B. Hostinsky, M. Fréchet and J. L. Doob. 
The second one (K) is due to N. Kryloff-N. Bogoliowboff [1], [2] and was intro- 
duced by them independently of W. Doeblin. We shall show (§4.7) that the 
condition (D) implies (K), and under the condition (K) all the results of 
W. Doeblin will be obtained in a more precise form (§§4.2—4.6). Our principal 
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results are stated in Theorems 5-12. Theorem 5 is a restatement of the uni- 
form ergodic theorem (Theorem 4 of §3), and this is a starting point of all 
the discussions of this chapter. Among other theorems, Theorem 6 is to be 
noticed. The formula (4.23) will show how the notions of the Banach spaces 
(M) and (M*) are essential in our problems. As a corollary to Theorem 6, 
we shall obtain the new notion of ergodic parts (Theorem 7); and the decom- 
position of 2 into ergodic kernels (= “ensembles finals” of W. Doeblin) and the 
dissipative part (Theorem 8) is also a direct consequence of Theorem 6. More- 
over, using the fact that under the condition (K) each proper value of 7 of 
modulus 1 is a root of unity (Theorem 9), the subdivision of the ergodic parts 
(and ergodic kernels) into cyclic parts will be easily obtained (Theorem 11 and 
its Corollary). 

The classical results concerning the Markoff’s process with a finite number 
of possible states may be easily obtained from our Theorems. In order to 
obtain these results, we have only to take a kernel P(t, E) of the special type. 
This will be carried out in §4.8. In this way, the hitherto known results con- 
cerning the Markoff’s process with a finite number or a continuum of possible 
states are obtained in a more precise form by a unified method. 


§4.2. Spectral decomposition of P‘”’(t, Z) under the condition (K). 
Theorem 5. Under the condition (K), P‘” (t, E) is decomposed into the form: 


k 
(4.8) P(t, FE) = UMP AE +S BE), n=1,2,-6, 
i=1 
where {di} (¢ = 1, 2, --- , k) are the proper values of T of modulus 1, and 
n (m) 
(4.9) l.u.b. > r= — Py,,@, £)| s se, 
$62,802 | m=1 ‘i n 





(4.10) [ P™ (t, ds)Py,(s, E) = [ P,,(t, ds)P (s, E) = 3 Py, (t, B), 


(4.11) [ Py, (t, ds)P,,(s, H) = Py,(t, Z) or 0 according as i = j ort # j, 


(4.12) [ Put 086, B = [ se dP, B = 0, 
2 2 
M 
4.13 Lub. | S(t © trier, 
( ) sa GHIS are 
t= 1,2,.--,k;n = 1, 2,--- , where M and e are positive constants which are 


independent of n. 

Proor. This theorem follows directly from Theorem 4 (uniform ergodic 
theorem) of §3. We have only to notice that, by Theorem 4, only the decom- 
position of the operators 7" is given and that the decomposition of the kernels 
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P(t, E) is not yet obtained. But, since the convergence him = >> T"/f = 


T,, is uniform, the decomposition of the kernels is leealiaatealie obtained. 
Remark. The uniform limit 


1s PG _ pap) 


no 1 m=1 ——— 











exists for any \ with |\| = 1. By putting A = 1 and remembering (4.5), we 
see that P,(t, Z) is not identically zero. Hence \ = 1 is a proper value of 7. 
If we put A; = 1, then the integral operator 7; defined by the kernel 














(4.14) P.(t, E) = lim | Fs P™(t, E) (uniform limit) 


no 2 m=1 





is a projection operator, which maps the Banach space (M) on the proper 
space of 7 belonging to the proper value 1. More precisely, we have 


[T+ +... 47 M 
n 











eed = cee 
(4.15) ~ a=, nm =1,2,.--, 








i 3 with a positive constant M. This is a result of N. Kryloff-N. Bogoliotboff. 
j | (The existence of the mean sojourn.) Moreover, the integral operators 7), 
pf (¢ = 1, 2,---,k) defined by the kernels P,,(t, FZ) are all strongly completely 
4 continuous. 












§4.3. Structure of the kernel P,(/, EF). By (4.5) and (4.14). we have 
(4.16) P,(t, E) = 0, P,(t, 2) = 1 










and the proper value equation in (M): 





(4.17) A a athe [ 2(dt) P(t, E) 





admits a non-trivial solution z ~ 0. In fact, by (4.10) of Theorem 5, for any 
by € 2, Pi(to, E£) is a solution of (4.17): 






(4.18) Pi(, E) = [ Pi(to, dt)P(t, E), 





and P(t, £) is not identically zero. 

In this section, we shall study the general form of such a solution of (4.17). 
and using the results thus obtained, the structure of the kernel P,(t, E) will be 
determined. 

In order to make our discussions clearer, we shall make use of some elementary 
notions from the theory of semi-ordered Banach spaces. A completely additive 
real-valued set function x(E£) ¢ (M) is called to be positive and is denoted by 
x 2 0, if we have x(Z) = 0 for any Borel set E of Q; and for any pair of real- 
valued set functions x(Z), y(EZ) « (M), denote by xz = y the relation z — y = 0. 
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Then the relation z = y determines a semi-ordering of the real Banach space (M). 
For any real-valued set function z(E) ¢« (M), its total variation Z(Z) is a posi- 


tive element of (M), and if we put 
2,(Z) = 4@(E) +2(Z)), 2(Z)=43@(Z) — x(Z)), 


then these are also the positive elements of (M). These are called the positive 
and the negative variations of x (on £) respectively. We have x = 24 — 2_, 
% = a, + z~ and, from the trivial relation: (—z), = z_, we have 
x— (x —y)+ =y — (y—2x)4. This last common element is called the mini- 
mum of two elements x and y, and is denoted by z A y. The special case 
xz A y = O requires our special attention. It is to be remarked that for two 
positive elements x and y of (M), x A y = 0 is equivalent to the condition that 
there exist two disjoint Borel sets Z, and EH» of 2 such that z(#,) = 2z(Q) and 
y(E2) = y(Q). This fact is needed in the following discussions. _ It is also to be 


noted that we have always 71 A x = 0, (x — y)i A (y — 2)y = O and 
(4.19) (x— (AY) AY-@AY)= 


Lemma 4.4. If x and y are two real-valued solutions of (4.17), then %, x4 , 2 


and x A y are also solutions of (4.17). 
Proor. It is sufficient to prove this for . Since P(t, #) 2 0 for any Borel 


set E C Q, we have 
2H) < [ 2(@)PU,B), 
2 


and, since P(t, 2) = 1, here must stand the equal sign. 
LemMA 4.5. There exists a system of completely additive set functions {z—(B)} 


(a= 1,2, , 1) « (M), with the properties: 
(4.20) T(tea)=2a, a2 0, (2) =1, ta A t= (a ¥ 8), 
such that any x(E) « (M) which satisfies  ~ 
(4.21) Tiz)=2, 220, 2Q)= 
is uniquely expressed as a linear combination: 
l U 
(4.29) 2(E) = >) cata(E), Ca 20; Dc = 1 
a= a=1 
Proor. Let 1 be the maximum number of elements 2, 22, -*+ , %: € (M) 
which satisfy (4.20). The existence of such an | is clear from the quasi-strong 
complete continuity of the operation 7. For, such 2, 22, --- , 2; are clearly 


mutually linearly independent and the proper space of 7 belonging to the 
proper value 1 is of finite dimension by Lemma 3.2 of §3. 





. Indeed, it will be easily seen that zx A y is the minimum of z and y in the sense of 
lattice, that is, 2 Ay < 2,2 Ay S y, and for any z « (M) with z < 2, z < y, we have 
zS2zAy. 
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We shall prove that this system {r.(#)} (a = 1, 2, --- , ) is just the required 
one. In order to show this, let 2(#) be an arbitrary element of (M) which 
satisfies (4.21). We shall show that x(Z) is a linear combination of 2; , 2, -.. , 
z,. For this purpose, consider the minimum te = 2 A eq Of & and 2, for 
a = 1,2,---,l. By Lemma 4.4, zq is also a solution of (4.17). We shall 
first show that each zx, is a constant multiple of 2. : Le. = Cela. Indeed, if 
this is not true for some a, then zz and zy = 2,/|| x4 || are not equal, and 
consequently (ra — te)4 and (x, — tq), are both ~ 0. Again by Lemma 4.4, 
these are also the solutions of (4.17). Hence, if we put 2a = (@%. — 
| (ta — te)4 ||, tee = (te — La)+/|| (ta — Za) ||, then the system of | +1 
elements 21, 22, +--+ ,2a-1, Lai, La2, Tati, *** » % Clearly satisfies (4.20), and 
this is a contradiction to the definition of J. 

Thus we have proved that each zr, = Z A Za is expressed in the form: 
x, = Cala, Where cq is a real number with 0 S ca S 1. Next we shall prove 
that we have x = ; r= 7 oe Cala. For this purpose, we shall show 
that 2’ = x — , xz. satisfies x’ A ta = O fora = 1, 2,---,l. In order 
to prove this, it is sufficient to show that we have (x — ta) A 2q = O for 
a = 1,2,---,l. This is, however, clear if c. = 1; for, te =X A La = Le 
implies « = 2q (since z(Q) = x,(Q) = 1), and 2’ = x — a, = 0. And, in case 
Ca < 1, this follows from (1 — ca)((t — ta) A ta) S (x — ta) A (1 — €a)ta = 
(c — 24) A (ta — Ly) = O (by (4.19)). Thus x’ A z2 = 0 is proved for a = 
1, 2,---,1l. Consequently, if we have x’ ~ 0, then the system of 1 + 1 ele- 
ments 2’/|| 2’ ||, 21, 22, --- ,2, will again satisfy (4.20), and this is also a 
contradiction. 

Thus we have proved that we have x’ = 0, and consequently z = )-jai24 = 
dp-1Ceta. Since the uniqueness of the expression and the condition 
>°i-1Ca = 1 are both clear, the proof of Lemma 4.5 is hereby completed. 

Coro.tuaRy. {2a(Z)} (a = 1, 2,--- ,1) is a base of all the solutions of the 
proper value equation (4.17); i.e., any x € (M) which satisfies (4.17) is uniquely 
expressed as a linear combination of 21, 2, +--+, 21. 

Proor. Clear from Lemmas 4.4 and 4.5. 

THEOREM 6. P,(t, FE) is expressible in the form: 


l 
(4.23) Pi(t, E) = 2 ye(t)ta(E), 
where {xa(E)} (a = 1, 2,---,1) is the system of completely additive set func- 


tions € (M), which was defined in Lemma 4.5, and {y.(t)} (a = 1,2,--- ,l) isa 
system of bounded Borel measurable functions ¢ (M*), which satisfy 


(4.24) T (Ya) = Ya; Ya(t) Pa 0, > Ya(t) = 1, 
and 


(4.25) [ La(dt)ys(t) = 1 or O according asa = Bora # 8B. 
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Moreover, {ye(t)} (@ = 1, 2,--- , 1) ts a base of all the sobutions of the proper 
value equation in (M*): 

4.26) Ty) = y: vd = | PU duis, 

and any solution of (4.26) with y(t) 2 0 can be expressed uniquely in the form: 
(4.27) y) = D eayelt 

with non-negative constants ca (a = 1, 2,--- ,l). 


Proor. Since Pi(t, Z) is a solution of (4.17) for any ¢t¢€Q, the expression 
(4.23) follows directly from Lemma 4.5. If we take a Borel set E., such that 
ta(E.) = 1 and as(E..) = 0 for any B ¥ a (the existence of such E,, follows 
from the fact that z.(2) = 1 and zg A x. = O for any B = a), then (4.23) 
becomes P,(t, E) = ya(t). Hence each y.(t) is a bounded Borel measurable 
function of t. We shall next prove (4.24). Since the second and the third 
relation of (4.24) are clear from (4.16), we have only to prove the first one. 
From (4.10) of Theorem 5 we have 


[ P@ as)Pus, #) = Prle, B), 
or, by (4.23), 
is ( [ P(t, ds)ya(8))20(E) = p> Ya(t)a(E), 


a=] 


and, if we put H = E’. , then we have 
[PG dadynls) = yol0. 


Thus (4.24) is proved. In order to prove (4.25), start from the relation 
T(ta) = ta. Since T(x) = 2 is equivalent to 7\(x) = x, we have 7T;(r.) = 
ta, Or 


[ za(dt)Pi(t, E) = r2(E), 


and, putting H = Ep, we have the required relation (4.25). 

Thus the first part of the theorem is proved. The second part may be 
proved as follows: Let y(t) be a solution of (4.26). From (4.26) we have 
(exactly as in the preceding case) 

yi) = | Prt, asdy6o), 
and, by (4.23), 


yi) = D veld ( f ra(ds)y(9)) =D cavald 
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with ¢, = / t,(ds)y(s). Since y(s) = Oimplies c. 2 0, the proof of Theorem ¢ 
Q 


is hereby completed. 


§4.4. Ergodic decomposition of (2 under the condition (K). Let us consider 
the sets E, = E,ly.(t) = 1],a = 1,2,---,l. By (4.24), these are mutually 
disjoint Borel sets. 

THErorEM 7. There exists a system of mutually disjoint Borel sets {E,} 
(a = 1, 2,--- ,l), such that 


(4.28) ta(Es3) = 1 or 0 according as a = B or a ¥ 8, 
(4.29) P(t,E.)=1, teEa, 


(4.30) hub. (22 P™¢, zB) — za(E) = 


te EwECL m=1 


where M is a constant which is independent of n. J 

Remark. (4.29) means that, for any a, each point ¢ eH, is transferred by 
the Markoff’s process P(t, Z) inside E,, and (4.30) means that the uniform 
limit 


LS pm. E) = P(t, BE) 


no ae m=1 


is independent of the initial point te#,. Because of these properties, 
E, (a = 1, 2, --- ,1) will be called the ergodic parts of Q. 

Proor or THEOREM 7. (4.28) and (4.29) are the consequences of (4.25) and 
(4.26) respectively. In order to show this, we have only to prove 

Lemma 4.6. If x(E) is a completely additive real-valued set function ¢ (M) such 
that x(Q) = 1 and x(E) = 0 for any Borel set E C Q, and if y(t) is a bounded 
Borel measurable real-valued function «(M*) such that 0 S y(t) S 1 for any 
téeQ, then 


[ z@vo = 


implies 


2(Ey) = 1, where Ey = Ely(t) = 
t 


Proor. If we put 


1 
E,=E\il—-s _ 
| nay <1 n i} 


t 


then we have 9 = E, + >-%_, E, and 
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ve [ x(di)y(t) = i x(di)y() + > [ r(dé)y(0) 


n=l 


< x(Ey) + > (1 - 4) 2@) 
< (Bs) + & 2B.) = 20) = 


Since the equality holds only if we have z(#,) = O for n = 1,2, --- , we have 
x(Ey) = 1. Thus Lemma 4.6 and hereby (4.28) and (4.29) are proved. 
(4.30) is a restatement of the relation (4.9) of Theorem 5 (¢ = 1, Ay = 1): 


l.u.b. 1 , ™ P™ (t, EB) _ Pxlt, E) . = n=l, 2, eee, 


teQ2,ECa | m=1 
if we only observe that we have 
P(t, Z) = x.(E) for teE.. 


As is easily seen, EZ. is not necessarily the set of the smallest measure with 
the property (4.28). Indeed, there might exist a Borel set EF C Eq such that 
mes (HZ) < mes (E.) and x.(E) = 1. If we denote by E.(C E,.) the Borel 
set of the smallest measure among those which satisfy (4.28), then F, is deter- 
mined up to a set of measure zero and E C E,, mes (Z) > O imply z.(E) > 0. 


We shall show that, if we take suitably the sets E. C E. (a = 1, 2,---, J), 
then the following theorem is true: 
THEOREM 8. The Borel sets Ea (a = 1, 2,---,l) and A = Q — a 
satisfy 
(4.31) P(t, EZ.) =1, tee, 
(4.32) P(t, E.) = 1, teE., 
for any teE, and E Cc E., mes (E) > 0 implies 
(4.33) P,(t, E) > 0 and consequently there exists a posi- 
tive integer n = Mie E) such that P(t, E) > 0, 
(4.34) lub. 2 > p™¢, a) = n=1,2,- 
teQ Mm=1 


where M is a constant which is independent of n. 

Remark. (4.31) means that, for any a, each point te FE, is transferred 
finally into EZ. (in the sense of arithmetic mean), and (4.32) means that each 
point te H, is transferred inside E,. Moreover, by (4.33), Ea is indecom- 
posable into two sets with the property (4.32).. In Theorem 12, we shall obtain 
4 more precise result than (4.34): 


l.u.b. PL, A) s ane n= 


with positive constants M rear €. 
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Because of these properties, Ea (a = 1, 2,---, 2) and Q will be called the 
ergodic kernels and the dissipative part of 2 respectively. It is to be noted that 
the ergodic kernels are only determined up to a set of measure zero (even under 
the condition (4.32)), while the ergodic parts E. are strictly determined by 
E, = Ely) = 1). 


Proor or THEorEM 8. Let E2(C E.) be the Borel set of the smallest 
measure among those which satisfy (4.28). E®. clearly satisfies (4.31) and (4.33) 
(since P,(t, Z) = x_(E) for t e E.), but (4.32) is not always automatically satis- 
fied for any te E®.. In order to obtain the required Borel set E., we shall 
construct a sequence of Borel sets E2 D E, D EX --. D EZ D .--- by mathe- 
matical induction. Let #2 be already defined and assume that we have 
ta(E%) = 1. Then we define E2™ as the set of all te £2 which satisfies 
P(t, Ea) = 1. Since 

















[ zelanPG, BD = | raldt)PC, B) = 2—(E2) = 1, 





we have (by Lemma 4.6) z,(E%*’) = 1. If we now consider the set E% = 
a<-1 La, then E, = Eis the required set. For, we have 


i ta(E2) = lim z.(E2) = 1 


‘i 
\ no 







and 






P(t, EZ) = lim P(t, Et) = 1° for te ES. 


n—?>o 







Thus we have proved the existence of the Borel set E, C E. which satisfies 
(4.31), (4.32) and (4.33). Since (4.34) is clear from 






P(t, A) = rs Yalt)te(A4) = 0, te, 





the proof of Theorem 8 is completed. 











$4.5. Proper values of modulus 1 of the operator 7’. 

THEOREM 9. Under the condition (K), each proper value of modulus 1 of the 
operation T is a root of unity. 

Proor. Let A, || = 1, be a proper value of the bounded linear operation 7’, 
which maps (M) into itself. By Theorem 5, \ may also be considered as a 
proper value of the bounded linear operation 7, which maps (M*) into itself. 
Indeed, (since P,(t, £) is not identically zero) if we put z(t) = P,(t, EZ) fora 
suitable Borel set #, then x(t) is a non-trivial solution of the proper value 
equation: 












(4.35) T(x) = dx: d2(t) = [ P(t, ds)x(s). 
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z(t) is clearly a complex-valued bounded Borel measurable function ¢ (M*). 


We shall prove that \ is a root of unity. 
In order to prove this, we shall first show that the real-valued bounded 


measurable function #(t) = | a(t) | « (M*) attains its maximum. From (4.35), 
we have 


z(t) < [ P(t, ds)(s) 


and consequently 


zi) s [ P(t, ds)z(s), n=1,2,.--- 
2 
Taking the arithmetic mean and considering its limit, we have 
2) < | Puls, as)a(s) 
2 


or, by (4.23), 
l l 
#) < Dyed ( [ ra(ds)z(9)) =D tered, 


a=] 


where 2 = [ X_(ds)x(s) is a real non-negative number. Let é be the: maxi- 
2 


mum of & (a = 1, 2,---,l). Since > par Ya(t) = 1 by (4.24), we have 
z(t) < for any te. We shall prove that this ¢ is attained by Z(t) at some 
point f ¢€2. Indeed, by the definition of ~, there exists at least one integer a 
such that & = [ Xa(ds)x(s), and since z.(Z) = 0, r.(2) = 1 and %(s) S é for 


2 
any s¢Q, there must exist, by Lemma 4.6, a point t& «such that Z(t) = & 


(or more precisely, the set Ey = E[Z%(s) = €] satisfies r.(Ho) = 1). 


We have thus proved that there exists a point & «2, such that (to) = 
| (to) | = max | z(t) | = & We shall next prove that the set 
te2 


(4.36)  E(n) = Ex) = dX" 2(b)] 
t 

satisfies 

(4.37) P(t, E(n)) = 1 

forn = 1, 2,---. From (4.35), we have 





A"2(4) = [ P (tp, ds)x(s) 


and, dividing by \"x(t) and taking the real part, 


. x(s) - 
1 = [ P (io, ae (<2),), n =1,2,--- 











216 KOSAKU YOSIDA AND SHIZUO KAKUTANI 


Since P™ (, EZ) = 0, P(t, 2) = 1 and n (2) < 1 for any 8 €Q, we 


have (4.37) by Lemma 4.6. For, we have 


| RZ) io 1] = E (2) = x"2(t)] 


Thus we have proved (4.37) for n = 1, 2,---. From this follows the exist- 


ence of two positive integers m and n, such that E(m)-E(n) # 0. For, if we 
have E(m)-E(n) = 0 for any couple of integers m, n (m ¥ n), then we have 


(4.38) P(t), EQ) = 0 for m #l, 
and consequently, by (4.37), 
Pills, B@) = lim + Y P™(o, B|) = 0. 
Since P,(%, E) is completely additive for Borel sets Z, this implies 
Py(bo , > E(l)).= 0. 
This is, however, a contradiction, since we have, by (4.37) and (4.38), 


P™ (4, SEM) =1, m = 1,2, -+, 
l=1 


and consequently 


Pi (to, 2 BO) = lim * YY P™ (, 3 BO) = 1. 


Thus we have E(m)-E(n) ~ 0 for a certain couple of integers m, n (m # n). 
Consequently we have (by (4.36)) A”"x(to) = A"x(to.)(¥ 0) or AX” " = 1, and 
hereby the proof of Theorem 9 is completed. 


§4.6. Decomposition of each ergodic part (and ergodic kernel) into sub- 
ergodic parts (and subergodic kernels). 

THEOREM 10. Under the condition (K), there exists a positive integer N such 
that P‘"”’(t, E) is decomposed into the form: 


(4.39) P°(t, E) = Prt, E) + S*, B), 
in such a way that we have 
[ PG ants, B) = f PM,a)P%, B) 
2 


(4.40) 
= [ PY (t, ds)Pi(s, E) = Pi (t, B), 
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4.41) [ P*(t, ds)S*(s, E) = [ S*(t, ds)P*(s, E) = 0 


and 

ie (n) M oll 
(4.42) lub. 18 (t,Z)| s ate’ oe i @..«. 
where M and « are positive constants which are independent of n. 

Proor. In Theorem 9, we have seen that each proper value \ of modulus 1 
of 7 is a root of unity. Since T has only a finite number of such proper values, 
there exists a (sufficient large) positive integer N such that \” = 1 for any 
proper value A of modulus 1 of 7. Hence the bounded linear operation 7” 
corresponding to the kernel P™ (t, E) has no proper values of modulus 1 other 
than 1, and Theorem 10 is a direct consequence of Theorem 5. 

Rewanm. It is to be noted that we have, by (4.39) and (4.42), 





(nN) _. p* ae — 
(4.43) Lub. | Pt) - PREIS gaa 1 = 1,2, 


160,ECQ 


with positive constants M and e. 
Moreover, just as in Lemma 4.5, there exists a system of real-valued com- 


pletely additive set functions {x;(E)} (¢ = 1, 2, --- , L) e (M) with the prop- 
erties: 


(4.44) T\(xi) =a, w20, uQ)=1, Act=0 (#)), 
such that each x2*(#) e (M) which satisfies 
(4.45) T'(2*)=2*, z*20, 2*(2) =1 
is uniquely expressed in the form: 
L 
(4.46) 2*(E)=Dicizi(E), 20, De =1 
i=l i=] 
In particular, just as in Theorem 6, P;(t, Z) is expressed in the form: 
L 
(4.47) Prt, E) = De yi Oxi (B), 
where {yz(t)} (i = 1, 2,---,L) is a system of real-valued bounded Borel 
measurable functions e (M*) which satisfy 
L 
(4.48) Pyi=", w¥O20 Lyd= 
w=1 
Let us denote by E7 the set Ey" = 1], 7 = 1, 2, ,L. These will be 


called the subergodic parts of 9. Exactly as in Theorem 7, these are mutually 
disjoint Borel sets, and we have 
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x? (E7) = 1 or 0 according as ¢ = jor 7 ¥ J, 
P(t, EY) =1, teE, 
M 


Lub. | PZ) — 2B) | S$ G@ppp 


teE7, ECL 





with positive constants M and «. More precisely we can prove 

THeoreM 11. The totality of all the subergodic parts Ei Gi = SS eee 
is divided into | classes (Ea, ig => , Be.) (a = 1, 2,--- , Ll), where d, isa 
divisor of N with )o)-1d. = L, in such a way that 
(4.50) P(t, Bai.) =1, te B,, i= 1,2, +++, da(aasi = a), 
M 


+e? n=1,2,.. 


) 


(4.51) Lu.b. | P(t, B) — #2,(E)| < ~ 
teBa ECR (1 
where M and ¢ are positive constants which are independent of n. 

Moreover, all E* , (i = 1, 2,--- , da) belonging to the same class are contained 
in the same ergodic part E.,, , and if we denote by a. ,(E) and yz, (t) the corresponding 
elements of (M) and (M*) respectively (which are obtained by the arguments given 
above), then we have 

* 


(4.52) T(xz,) = Lace; $= 1,2,---, dela =a), 
(4.53) T (yz; +1) = oe = 5, 2, ale Ra da (aa,41 = a), 


(4.54) t.(E) = ; - x*.(B), 


(4.55) ye(t) = > yi (0b. 


Remark 1. (4.50) means that, for any a, each point t ¢ 2/1 x is trans- 
ferred, by the Markoff’s process P(t, £), cyclically in EX, ; B.. ; vee (Mow,3 
and (4.51) means that, in each £2, , P(t, EZ) defines a Markoff’s process, 
whose nt* iterate P‘"“” (t, E) is uniformly convergent to a limit which is inde- 
pendent of the initial point te E, : 

Remark 2. The equality Z, = >0%, £2, is not necessarily true. Indeed, 
D. = E, — Do%: E., is the set of all £€Q such that y.(t) = > es ya, (t) = 1 
and ya; (t) <1 for? = 1,2,---,da. From the proof of (4.51), we see that 

da 
(4.51) lub. |P™¢, BE) — Yyt.@rt,() |< 4 
he : (1 + €)”’ 


teEg, ECQ 


=1,2,++ 


with positive constants M and e. 

Proor or THEOREM 11. We begin with some preliminary considerations. 
Sinee T° 7 (a7)-= TT (x7) = T(a7), T(az) satisfies (4.43) fori = 1, 2,---,L. 
Hence there exists a system of real constants ¢;;, 1,7 = 1, 2, --- , L, such that 
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L L 
(4.56) T (a?) == > Ci; , c; 20, > Cc; = 1. 

i= j= 
Thus 7 may be considered as a linear transformation on the system {x?(E)} 
(i = 1,2,---,L). Consider the matrix C = (c;;), i,j = 1, 2,---,L. This 
matrix clearly satisfies 
(4.57) Cc” = unit matrix. 


We shall show that C is a matrix composed of only 0 and 1, which defines a 
permutation of the indices 1, 2,---,. Indeed, denoting by c{"~” the ij-ele- 
ments of the matrix C”~”, we have Ditici” = 1, Ditaci! Yes = 1 for 
i= 1,2,---,L. Since 0 S cy S 1 for any k and 7, there must exist, for 
each i, an index k; such that c;; = 1. In other words, each column of the 
matrix C must contain 1 at least once. Since >> cx: = 1 for each k, we must 


have k; ¥ k; for «7 # j. Hence, (ki, ke, ---,kz) is a permutation of the 


indices (1, 2,--- ,Z), and consequently we have cq; = 0 fork ¥ k;. 
Thus we have proved that C defines a permutation of the indices 1, 2, --- , L. 
Hence the indices 1, 2, --- , L are divided into l’(S JL) classes, and each class 


is permuted cyclically inside itself by the matrix C. Let these classes be Ka 
(a = 1, 2,--- ,l’) and the number of the indices belonging to Ka be da. By 
(4.57), each d, is a divisor of N. 

For each a, consider the set of all the indices which belong to K.. By a 
suitable numbering a1 , a2, --- , aa, Of these indices, we must have ¢a,a;,, = 1 
and Ca;a; = Oforj ¥i+1(¢ = 1, 2,--- , da; once for all, we put ag4i = ay). 
Consequently, we have (4.52). 

After these preliminaries, we shall proceed to the proof of Theorem 11. We 
shall first prove that we have | = I’ and that there is a one-to-one correspondence 
between the ergodic parts EZ, and the classes K. in sucha way that (4.52), 
(4.53), (4.54) and (4.55) are true. ((4.52) is already proved.) 

For this purpose, recall that each 2,(£) satisfies T(r.) = 2a. Hence x.(F) 
satisfies (4.45), and consequently x,.(F) is uniquely expressed in the form: 


L l= de 
(4.58) 24(Z) = cf 2;(£) = >, ee Cx, ta,(E). 


We shall first prove that there exists a class K,. such that c; = 1/dq for 
ieK, andc; = 0 foriéK.. In the first place, it is clear that c;_is inde- 
pendent of 7 in each class K,. For, since T(ra) = %_ and T(2a;) = Ta;4; 5 
we have, from (4.58), 


r(E) = >> >> ct, 2%,,,(B), 


a=1 i=1 


and this implies cz, = Sales fori = 1, 2,---,d,. In the second place, all 
indices i with ci > 0 belong to the same class K.. For, if this is not the case, 
then 2.(E) will be decomposed into two non-trivial parts x, and x, , which are 
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both invariant under 7: t2 = 2, + ta, Za > 0, 22 >0, te A Ta = = 0, 
T(r.) = x, Tae) = 2a. If we put ra = tq/|| ta || and tae = z'n/\| 2 |! 
then the system of 1 + 1 elements 2%, 22, +--+ ,2a-1, Tai, Ta2, Lati, +++, 2; 
will satisfy the condition (4.20), and this is a contradiction to the definition of 1. 
Thus there must exist a class Ka such that c; = c > 0 fori eK, andc* = () 
for i@éK,. Since it is clear that we have c = 1/da, the relation (4.54) is 
hereby proved. 

Thus to each z,(E) (a = 1, 2, --- , 2) there corresponds a class of indices K,. 
Conversely, as is easily seen, to each class K, there corresponds a completely 


additive set function x,(#) = > >, x3,(E) ( which clearly belongs to the 


system determined in. Lemma : 5) in such a way that the correspondence 
Zq(E) <— K,.is one-to-one. Hence we must have 1 = lI’, and the one-to-one 
correspondence between the ergodic part E, and the class of indices K, is also 
established. 

We shall next prove (4.55). From the relation 


Ld | Pra, dP, B) = Pal, B) 
we have, " (4. wg and (4.47), 
LE LE v.0([ aor, ) = Dvr, 


N m=1 a=l1 i=1 


or, by (4.52), 


Ue > ¥ 2 x%.0(B) = du Ya(t)a(E). 


N m=i a=l1 i=l 


Using the fact that d, is a divisor of N, we have 


> (= v2.00 )( 7 3 :#,)) = Y yell) 2a(B), 


a=1 \i=1 da i=1 


or, by (4.54), 
l 


x (= 2,00) ta(E) = p> Ya(t)t_(E); 


a=1 


and this implies (4.55) (put Z = E,). 
In order to prove (4.53), we start from the trivial relation: 


[ P(t, ds)P}(s, Z) = [ PY (t, ds)P(s, BE). 
2 2 
- “ 45) and (4.52), this becomes 


¥ ( > (f P(t, as)y2.(9)) ta,(E) = . : Ya; (t) (J. ta, (ds)P(s, #)) 


oe ¥ Ya; (t)Xe;4,(E), 


a=1 i=1 


> i=1 
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and, putting E = Bocas , we have 
[ P@ ds)ya441(8) - ya, (t); 


e., (4.53) is proved. 
Thus we have _proved (4.52), (4.58), (4.54) and (4.55). From (4.55), it is 


clear that each Ex, is contained in EF, , and (4.50) is a direct consequence of 


(4.53) (use Lemma 4.6). 
Lastly, we shall prove (5.51). 


by (4.6) and (4.37), 
parte) (¢ Ei) = [ P®”(t, ds)P“(s, E) = [ Pi (t, ds)P“(s, E) 
Q 


For any positive integers n and k we have, 


+ [ S*™(t, ds)P“*(s, E). 


Since we have 


[ Pia, dP (6, £) = [| 22,(a)P™(s, BE) = 22,8) 
Q Q 


for any t e #2, , and since, by (4.40), 


Tf s*(, ds)P™(s, B) S lub. | 8°) | $ go> 





forn = 1,2, --- , we have 


(nN+kdq) me... M 
|P (t, E) Le;(E) | Ss G+e* + .* 


forte B3,,k = 1, 2,---,N/daj;n = 1, 2,---, where M and e« are positive 
constants which are independent of n and k. Hence we have (4.51), by a suit- 


able change of M and e. 


Thus Theorem 11 is completely proved. 
Remark 3. We can also define subergodic kernel EX, ; in each subergodic part 


Ez, jnamely, E%, is the set of the smallest measure which satisfies te,(Es,) = 1. 
If we suitably take EZ, and E%, (which are all determined only up to a set of 


measure zero), then we have 


CoROLLARY. 
(4.59) E. = p> E<., E>, = E. E23, ’ 
(4.60) P(t, Et.) =1. te E2,. 


Proor. (4.59) is clear from (4.54), and (4.60) follows from (4.29), (4.50) 
and the second relation of (4.59). 
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It is to be noted that these Z, and Fl are exactly the final sets (ensembles 
finals) and their cyclic subsets which were discussed by W. Doeblin [1]. 
THEOREM 12. 
M 


lob P"'¢ 2 Ss .——, = wm 
(4.61) u-b (t, )S aye n=1,2, 


where M and « are positive constants which are independent of n. 


. * _ 
Proor. Since each 2x,;(F) satisfies 


xa,(A) < > x* (A) = da-ta(A) = 0, 


i=1 


we have 


Pr(t, A) = > yi (2; (A) = 0 


for any t eQ, and consequently we have, by Theorem 10, 


M 
Pe om Se ee 
aa t, Saye 


Since P“” (t, A) is monotone decreasing in n, we can deduce from this easily the 
relation (4.61) (by a suitable change of M and e). 


r= 


§4.7. Deduction of the condition (K) from the condition (D). 

Lemma 4.7. Let us denote by I(so) the closed interval O S s S 8. Then 
Q(t, s) = P(t, I(s)) is Borel measurable as a function of two variables t and s in 
0OSts<1. 

Proor. By assumption, Q(t, s) is Borel measurable in ¢ if s is fixed, and if t 
is fixed Q(t, s) is monotone increasing in s and is continuous on the right: 


lim Q(t, 8) ma Q(t, 8). 


8—>8ot0 


We shall prove that, for any a, the set E(a) = E [Q(t, s) < al is Borel 
(t,8) 


measurable as a two-dimensional point set. For this purpose, put E,(a) = 
E (Q(t, s) < a]. Since Q(t, s) is monotone increasing in s, we have E,,(a) D 
t 


E,,(a) for s; < s: , and consequently 


E(a) = Dy E,(a) X I(s), 


where X denotes the Cartesian product. Since Q(t, s) is continuous in s on 
the right, the section of E(a) by the straight line t = & is, if not empty, a semi- 
open interval of the form: 0 S s < s. Hence we have 

E@= Bala) xX I(s), 


s=rationa 
se Q 


and this shows that E(a) is Borel measurable as a two-dimensional point set. 
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Lemma 4.8.'° Let K(t, E) and n(t, E) be two kernels with bounded density: 


K(, B) = | Meds, |W, #)| S || Kl 


Nit, E) = [ n(t, s)ds, | n(t, s)| < || NII, 


where k(t, s) and n(t, 8) are both bounded measurable functions defined on 0 &S t, 
s <1. If we consider the corresponding integral operators K and N, which map 
the Banach space (M) into itself, then the integral operator NK defined by the 


1 
kernel / ( l k(t, u)n(u, 8) au) ds is strongly completely continuous as an operator 
E\4 
which maps (M) into itself. 


Proor. For any x(£) « (M), put y = K(x) andz = N(y) = NK(z). The 
set function z(£) is absolutely continuous: 


2(Z) = / 2'(s) ds, 
and its density z’(s) is given by 
z2'(s) = [ y(du)n(u, s) = I (/ x(dt)k(t, u) )n(u, s) du. 


Hence || x || S 1 implies 
lu.b. |2’(s) | S || K||-|| || 
Osessl 


and 
[Ciee+9 —z'(s)|ds S iif, (f |n(u, s + 6) — n(u, s)|du) ds 


if we put n(u, s) = 0 for s < O and s > 1. 
Consequently, we have 
+ 


lim |z’(s + 6) — 2’(s)|ds = 0 

uniformly for all «(#) « (M) with ||z|| < 1. Hence, by a theorem of A. Kol- 
mogoroff [1] and M. Riesz [1], (if we consider z’(s) as an element of the Banach 
space (L)) the totality of all z’(s) corresponding to the unit sphere || z || < 1 
of (M) is strongly compact in (L). In other words, NK is strongly completely 
continuous as an operator which maps (M) into (L). Since (L) is isometric 
to a closed linear sbspace of (M), NK is also strongly completely continuous 
as an operator which maps (M) into itself. Thus the proof of Lemma 4.8 is 
completed. 





' KX. Yosida, Y. Mimura and 8S. Kakutani [1]. 
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Lemma 4.9. If P(t, E) satisfies the condition (D), then we have 


(4.62) P(t, E) = [ q(t, s)ds + R(t, BE), 


where q(t, s) is a bounded (« ‘) Borel measurable function defined for 0 Sit, 
n 


s S 1, and 
(4.63) 0< R(t, FE) <1-—b 
for anyteQandE CQ. 
Proor. Define Q(t, s) newly by 

Q(t, s) = P(t, I(s)). 
By Lemma 4.7, Q(t, s) is Borel measurable as a function of two variables t 
and s. Since Q(t, s) is monotone in s for any ¢, Q(t, s) is almost everywhere 
differentiable in s for any fixed t. If we put 

; 1 
p(t, ») = tim n(@(t,2+2) - 069) 


no 


and define q(t, s) by 


a(t») = min (pt, 9), +), 


then q(t, s) is bounded ( < ‘) and Borel measurable as a function of two variables 
n 


t and s. We shall prove that the kernel R(t, EZ) defined by 
R(t, E) = P(t, B) — | a(t, s)ds 
E 


satisfies (4.63) for any teQ and EF C Q. 
For this purpose, we have only to notice that there exists for any t e 2 a Borel 
set N, of measure zero such that 


P(, B) = | pit, sds + PC, NeB) 
Ez 
for any Borel set H C 2. Then we have 


0< Ri, B= [ (lt, s) — att, 8) ds + P(t, Ni-E) 


=| plys)ds + P, N,-B) 
E°E;(n) 


= P(t, E,(n) + N}), 
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where E,(n) = E EZ s) > 4} Since clearly mes (£;(n)) < 9 for any ¢ €Q, 


we have mes (E;(n) + Nz) < 9 and consequently, by (D),0 Ss R(t, Z) <1 —b 


for any teQand EC Q. 
TueorEM 13. The condition (D) implies the condition (K). 
Proor. By Lemma 4.9, we have 


(4.64) Tl=Q+R, ||R|| S1-4, 
where Q and R are the two integral operators which are defined by the kernels 


Q, £) = [ q(t, s) ds and R(t, E) respectively. If we expand T*” = (Q + R)” 
Ez 


in 2” terms: 
Y hie = Q” -- Q”"R + Q” "RQ + rit +} QR” + . 


then the terms which contain Q at least twice as factor are all strongly com- 
pletely continuous. In order to see this, consider for example the term 
RQRQ””*. By (4.22), QR and Q” * are both integral operators with bounded 
density kernels. Hence, by Lemma 4.8, QRQ”~ and consequently RQRQ”~* 
are strongly completely continuous. Since the number of terms which con- 
tain Q at most once as factor is m + 1, and since the norm of each such term 
is < (1 — b)”” by (4.64), we see that there exists, for each m, a strongly com- 
pletely continuous operator V,, , which maps (M) into itself, such that 


| 72" — Vn || S (m+1)0 — by". 


Since the right hand side converges to zero as m — «, the proof of Theorem 13 


is hereby completed. 
Remark. The converse of Theorem 13 is not true. To see this, take an 


arbitrary point s ¢€Q, and define P(t, Z) by 
P(t, FE) = 1 if se, 
= 0 if Hé E, 


for any teQ. This kernel P(t, HZ) defines a strongly completely continuous 
integral operator, but the condition (D) is clearly not satisfied. 


§4.8. Markoff’s process with a finite number of possible states. Consider a 
Markoff’s process with a finite number (= N) of possible states. Let pi; 
(i,j = 1,2, --- , N) be the transition probability that the i state is transferred 
to the j state after the elapse of a unit-time. Then the transition probability 
pi; that the i state is transferred to the j state after the elapse of n unit- 


times is given recurrently by 


N 
(4.65) pi} = »» Pe Piy, = Dik = Pas 
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and we have always 


N 
(4.66) por 20, Dopp =1, 


j=1 






fori,j7 = 1,2,---,Njn=1,2,---. 

We shall investigate the asymptotic behavior of ps} forlargen. This problem 
was discussed by many authors (see the introduction at the beginning of the 
paper), and sometimes direct methods were successful in this case. We 
shall, however, treat this problem as a special case of our general Markoff’s 
















process. 
Consider the finitely valued function p(t, s) defined in the square 0 < t, 
s <1 by 
| (4.67) p(t,s) = N-p,; for aot git< + it Ss< ‘. 








i,j = 1,2,---,N (incaset = N orj = N, < is to be replaced by S$). Then 





Pit, E) = / p(t, s) ds defines a simple Markoff’s process on the interval 
B 





Q = (0, 1), and we have 





(4.68) P(t, Z) = [ p” (t, s) ds, 





where 






Se ee ee 
got eae , 


WT St<y Gp se<y 
i,j = 1,2,---,N (in casei = N orj = N, < is again to be replaced by 3S). 
Thus the Markoff’s process P = (p,;) (t,j = 1, 2, --- , N) is reduced to the 
continuous case P = P(t, E). Since it is clear that the corresponding integral 
operator 7’ is strongly completely continuous in this case, we have, by the 
results obtained above, 
THEOREM 14. (i) The limit 








(4.69) p™(t,s) = N-ps? for 















(4.70) lim 1 > ps = p? 


no 2 m=1 
exists for any i and j, and there exists a constant M such that 


1 . m C-) * . 
(4.71) 5 oe PP — DP a=, t,j =1,2,---,N;n =1,2,--:. 





(ii) There exists a system of mutually disjoint ergodic parts EB. (a = 1, 2, --- ,!) 
such that 


(4.72) DL pr =1 if ieBa, 


i¢Bq 
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(4.73) ps? is independent of i in each E, . 

(iii) Each ergodic part E., contains an ergodic kernel E, such that 
(4.74) p> po =1 if ies, 
(4.75) p> pi=1 if ieEe. 

(iv) A= 2 — >-i-1 Eq is called the dissipative part of 2 and we have 

M 5 
(m) Spe estos. Fae [= eee . = eee 

(4.76) Pii = a rN €)” jeA,2 1, 2, ; N; n 1, 2, 


with positive constants M and «. 
(v) To each ergodic part E., there corresponds a positive integer dq such that 


Mea M 
(4.77) | pit? — dap?| < 





ieE.,j =1,2,---,Nj;n =1,2,--- 


(1+ 6)” 

with positive constants M and «. Moreover, each E,, contains d, (mutually dis- 

joint) subergodic parts EX ' } od ee Bea, such that 

(4.78) > pr=1 ff ieF%,, k=1,2,---,da (aa,41 = a). 
je Ba,,, 

(vi) If we further put E>, = E,-E>, for any a and k, then we have 

(4.79) > pi=1 ff ieE,, k=1,2,---,da (aaj = a1), 

ve Bays 


Added in proof. Recently, N. Dunford and B. J. Pettis [1] obtained some 
new results concerning weakly completely continuous operators defined on the 
space (L). Among others, they proved that if K and N are weakly completely 
continous operators which map (LZ) into itself, then NK is strongly completely 
continuous. This result is more precise than Lemma 4.8. 
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ON TYPES OF “WEAK” CONVERGENCE IN LINEAR NORMED 
SPACES 


By Inco Mappavs, Jr. 
(July 28, 1939) 


Introduction. In his paper “On a generalized notion of convergence in a 
Banach space,” B. Vulich (Vulich 1, pp.’ 156-174) has pointed out that it is 
often possible to introduce a metric into a given concrete space with a given 
notion of convergence of a sequence of elements such that convergence in 
accordance with the metric is equivalent to the given convergence notion. How- 
ever, it is not always possible to do this in terms of a metric which is a distance 
function (Vulich 1, p. 163). Consequently, Vulich has introduced a generalized 
metric by associating with each finite set or complex of a given linear set’ a 
non-negative real number or norm. A set of six axioms imposed on this gen- 
eralized metric and a limit notion defined in terms of it give rise to the concept 
of K-normed spaces.” The K-normed spaces are found to be special cases of 
Banach spaces (Banach, p. 53), and the K-convergence of a sequence of elements 
to a limiting element is found to imply Banach convergence to the same element. 
That is to say, K-convergence of a sequence to an element is stronger than 
Banach convergence of the sequence to the same element. 

In this paper, also, the notion of the norm of a finite complex is employed. 
However, this norm is subjected to only three of the six axioms used in the 
definition of the K-normed spaces of Vulich. A limit notion for sequences is 
defined in terms of this norm. Among other things it is shown that point-wise 
convergence of a sequence of continuous functions to a continuous function on 





1 By a linear set will be meant the usual one given in Banach’s book on page 26. 
* A linear set X is said to be K-normed if there is associated with each finite set or 


complex of its elements a non-negative real number or norm, written || (1 , 22, --- , Zn) ||, 
which satisfies the following axioms: 

Axiom A. If x; = 2;, then || (m1, +--+, 2i, °°: %j,*+*,2n) |] = || (Qi, +++, Zen, 
Titty *%* y Lp, eee » En) lI ” || (a1, ttt y Bi, cee, Vji-t, Viz1, *** In) ||. 

Axiom B. || (x) || = 0 implies z = 6, where @ is the null element of the linear set. 

Axiom C, || (x1 9 *** yn, zi ae » %) || Ss || (x1, hgh » Tn) || + || (x; ' » &) ||. 

Axiom D. || (a1 ee In) I| s II (1, cer Ins En+1) I|. 

Axiom E. || (a1 + af, «++, an + 2%) || S [| (i, ++, am) I+ Il at, oes» ee) I 

Axiom F. || (az, --- , ata) || = | a@|-|| (ai, -*+ , n) |I- 

By 2n —» x it is meant that given any e > 0 there exists an N(e) such that n 2 N(e) 
implies || (tn — 2, +++ , 2nyp ~ 2) || < efor all p => 0. The sequence {z,} is said to be 
K-convergent if given any e > 0 there exists an N(e) such that if n, m = N(e) then 
|| (tn — tm *** , Lnxp — Lm) || < e for all p = O. 


A linear set is said to be a K-normed space if it is K-normed and if the limit notions 
are given by the definitions of the previous paragraph. 
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the finite closed interval reduces to the convergence of the sequence to the same 
function in terms of the norm of this paper. In all concrete cases considered it 
is seen that convergence of a sequence to an element in accordance with the 
limit notion to be defined is weaker than Banach convergence of the sequence 
to the element. Linear functionals and operations which are continuous in 


terms of this limit notion are studied. 


1. Axioms and fundamental notions. Consider a linear set X of elements z 
such that there is associated with each finite subset or complex of elements of X 
a non-negative real number, called the norm of the complex and written 


|| (a1, t,---, an) || for the elements 2, 22,---,2,. This norm will satisfy 
the following axioms: 

Axiom A. Ifz; = 2;, then || (m,---, 2, +--+ ,2%j,-++,%n) || = || (ai,---, 
Vi-1 » Vig y+ y Vj, ee? » Tn) | = | (m1, ct > Dey ees y Uj-1, Ljg1,-°* 5 Xn) ||. 

Axiom B. || (x) || = O implies x = 6, where @ is the null element of the linear 
set X. 

Axiom 3. If a, a2,---,4, are real constants, then |] (ai7, dere, ---, 
Ann) || S max | a; |-|| (v1, 22, +++ , Xn) |]. 

lsisn 


A linear set with the norm of a finite complex defined and satisfying Axioms 
A, B, and 3 will be said to be H-normed. 

DEFINITION 1. A sequence {z,} of elements of an H-normed set X will be 
said to be convergent in the H-sense, or to be H-convergent, if given any e > 0 
and any infinite subsequence {2,,} of {z,} there exists a Ko = K(e, {x,,}) such 
that k, s 2 Ko imply 


(1) | (Tn, Says tll oO Bg < e. 


DEFINITION 2. A sequence {z,} of elements of an H-normed set X will be 
said to be convergent to the element x « X in the H-sense, or to have 2 as its 
H-limit, written x, —» x or H-lim x, = 2, if given any e > 0 and any infinite 


nN=00 
subsequence {2z,,} of {x,} there exists a Ko = K(e, {xn,}) such that k 2 Ko 
implies 


(2) | (Zn, — 2, -++ tm — 2) | < e. 


An H-normed set for which the limit notions are given by Definitions 1 and 2 
will be referred to as an H-normed space. 

It is evident that in an H-normed space if a sequence converges or converges 
to an element x then every subsequence does likewise. 

Remark 1.1. It should be noted that our Axioms, A and B are the Axioms 
A and B of K-normed spaces and Axiom 3 is a consequence of the axioms of 
K-normed spaces (Vulich 2, p. 61). Axioms A, B, and 3 are independent 
(Vulich 2, p. 60), and the examples which will be given in §2-5 will illustrate 
that there are H-normed spaces which are not K-normed according to the 
same norm. 
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The set of all real numbers forms an H-normed space if || (21 , 22, --+ , Zn) ||= 
inf (| 1 |, | v2 |, --- , | %n |), where by “inf” is meant the smallest of the numbers 
involved. If in this same set z, —, x means that given any e > 0 there exists 
an N(e) such that n 2 N(e) implies that |z, — z| < e, then we have 

TueoreM 1.1. In the set of all real numbers x, —x x is equivalent to 2, —» 2. 

Proor. For any infinite subsequence {2,,} of {2p}, || (fa, — Z,°*, 
tm, — &) || = inf (| a, — 7|,---,|%y, —2|) Slay —2|. If zt. —s 2 then 
this last expression approaches zero ask — «©. To prove the converse assume 
that 2, a 2 does not imply z, sz. Then there exists an infinite subsequence 
(rn,} Of {z,} and a positive number X such that | z,, — z| > A > 0 fork = 
1,2,---. ThenO <A < || (ta, — 2, ---, 2% — 2) || for every integral k. 
This contradicts the assumption that 7, —,z 2. 

Remark 1.2. In general any linear set for which an F-metric (Banach, p. 35) 
is defined is an H-normed space if || (a1, --- , %n) || = inf [(a1, 0), --- , (tn, 8)). 
By Theorem 1.1 x, —x Z is equivalent to z, — 2 in accordance with the F-metric. 
Hence, any linear normed space’ is also an H-normed space, and when the norm 
of a finite complex is defined as in this paragraph norm convergence of a sequence 
to an element is equivalent to H-convergence to the same element. Under the 
same circumstances norm convergence of a sequence is equivalent to H-con- 
vergence of the sequence. 

The following theorems relate to H-normed spaces. 

THEOREM 1.2. If 2, yx and x, = 2, for every n, then x, —y &. 

THEOREM 1.3. If Xn x 2, then for any x’ ¢€X,2%,+ 2’ ~x rt 2’. 

RemakK 1.3. In Remark 3.1 an example will be given which will show that 
In a x and y, —x y do not imply that rz, + Yn x © + Y. 

Because of Theorem 1.3 it is evident that the topology of an H-normed 
space is a uniform topology. 


THEOREM 1.4. || (az, --- ,a@%n) || = |a|-|| (a1, --+ ,2n) |]. 
Proor. || (am, ---, azn) || S|a|-|| (a, ---,2n) || = |a|-|| (aa nm, ---, 
aa 'zy) || S |a|-|@"|-|| (am, --- , ax) || = || (am, --- , az,) ||. This com- 


pletes the proof. 

TurorEM 1.5. If {an} is a sequence of real numbers such that |a,| S M, 
where M is independent of n, and if Xn x 0, then Antn x 8. 

Proor. Choose any infinite subsequence {@n,2n,} Of {ant,}. Then || (@n,2n,, 
“++ Qn) || SM || (tn, ,---,2n,) ||. Since 2, x 0 it follows that the right 
hand side of this inequality approaches zero as k > ©. 

Corotuary 1.51. If {an} is a sequence of real numbers such that a, —» 0 and 
Uf tn Hy 0, then Antn —y 8. 

Corotiary 1.52. If ais areal number and x, x 2, then ax, xn ax. 





* A linear normed space is a linear set X with the property that there is associated with 
each element z a non-negative real number or norm, designated by || z ||, which is such 
that (1) || # || = 0 implies z = @, (2) ||z+y || S$ llz ll + lly ll @) ll@ll = [elle 
foreachrealé. A sequence {z,} of X is said to be convergent to z e X in the norm or Banach 
sense if [| an _ z||-0Oasn— oo, 
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2. The space (CH). Consider the set of all continuous functions defined on 
the closed interval (0, 1), and let the norm of a finite complex of elements be 
given by || (v1, --- ,2n) || = max inf (| a(t) |,---,|2n(t)|). With this norm 

0<t<1 


the set becomes an H-normed space and it will be designated by (CH). 

TuroreM 2.1. In the space (CH) a necessary and sufficient condition that 
In —n x ts that x,(t) — x(t) for each t € (0, 1). 

Proor. Because of Theorem 1.3 it suffices to prove this theorem for the 
case Xn —n 8. eae 

Necessity. Assume on the contrary that for some fo , lim | tn(éo) | = 2A > 0. 
Then there is a subsequence {z,,} of {z,} with the property that | 2,,(é) | > 
for each integral k. Therefore, || (%n,, +++, nm) || 2 inf (| tn,(to) |, ---, 
| an,(to) |) > A > O for each integral k. Consequently, 6 is not an H-limit of 
{z,}, in contradiction to the assumption of the theorem. 

Sufficiency. Assume that there is a subsequence of {z,}, which will also be 
designated by {z,}, such that || (a1, --- , 2») || is greater than a positive number 
\ for n = 1, 2,---. Designate inf (| x(t) |,---,|an(t) |) by fa(é). Fora 
fixed value of n it is easily shown that f,(¢) is a continuous function on the closed 
interval (0, 1). Hence, there exists a ¢, such that f,(t,) = max lf.(t) | = 

sts 


|| (a1, +++ ,2n) ||. By the Weierstrass-Bolzano Theorem it is possible to choose 
a ieee {tn,} of {tn} and a point & such that on tn, = bo. 


Each fr(to) 2 A. For if fnr(to) < » there would siniat an interval J about & 
such that if te J then f,-(t) < ». Now choose n, > n’ and such that t,, €J; 
then fn,(tn,) <fn'(tn,) < , which is a contradiction. But f,(t) S | x,(t) | for 
all x and ¢, so | ta(to) | > fn(to) 2 A, in contradiction to the assumption that 
Xn(t) — O for all t € (0, 1). 

Remark 2.1. If 2,22, ---,2%, and 21, 2%, ---, x, are two finite complexes 
of an H-normed space it does not follow that || (a1 + 21, a + 22,-::; 
tn + xn) || S || (tr, 22, ---,2n) || + |] (vi, 22, ---, 2) ||. For consider the 
following complexes from the H-normed space (CH): x = 2, = 5and 2’ = 4, 
x’ = 2. In this case the inequality sign is actually reversed. If the sign of % 
is changed then the inequality sign is again reversed. Therefore, unlike the 
norm of the K- normed spaces of B. Vulich, the norm of a finite complex of a 
H-normed space does not satisfy a “triangle property,” nor does it satisfy a 
“triangle property” with the sign reversed. 

REMARK 2.2. In the space (CH) it can readily be shown by methods similar to 
those employed in the proof of Theorem 2.1 that a necessary and sufficient condi- 
tion for the convergence of a sequence in the H-sense is: | 2,(t) — 2m(t) | > 0 
for each te (0, 1). There are numerous familiar examples from the space (CH) 
which illustrate that H-convergence of a sequence does not imply that the 
sequence has an H-limit which is of the space. That is to say, an H-normed 
space is not necessarily complete with respect to the H-convergence notions. 

Remark 2.3. The norm of a finite number of elements of an H-normed space 
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is not necessarily a continuous function of its arguments; i.e., r{ >, 2‘ for 

: : : (1) (p) (1) (p) 

i = 1, 2,---, p does not imply lim || (z,°, --- , 2,”’) || = || (x pose eB yi. 
n=00 


For consider the following sequence from (CH): 
x(t) = z,(0) = O for 2/n St S 1, 
£n(1/n) = n, 


and In(t) linear elsewhere (n = 1, 2, $a % ) (Caratheodory, p. 171). This se- 
quence is such that 2, —z 6. Let a second sequence be given by 2) (t) = 1 for 
each n and for te (0, 1). Then t_ —21. Moreover, Il (te, 22) “eT 
each n, whereas || (6, 1) || = 0. 


3. The space (L’H),p = 1. The norm of a finite complex which was used in 
the previous paragraph to make the set of all continuous functions on the finite 
closed interval an H-normed space was obtained by operating on the greatest 
lower bound of the absolute values of the elements involved with the norm 
which is generally employed to make the set of all continuous functions a 
Banach space (Banach, p. 11). To define || (a, --- ,2,) || in an analogous 
manner for the set of all functions defined on the closed interval (0, 1) and 


1 
whose pth powers are summable set it equal to ‘ [ finf (| a(é) |, ---, 
0 


1/p 
| an(t) |)]” i} , where by “‘inf’’ is meant the greatest lower bound of the func- 


tions involved at every point of (0, 1). With this interpretation of the norm 
of a finite complex the set of functions considered is an H-normed space. This 
space will be referred to as the space (L’H), p 2 1. It will be seen from what 
follows that the H-convergence of a sequence of elements of (L’H), p = 1, to 
an element of the space is extremely weak. 

Lema 3.1. If {f,} 1s a sequence of summable functions defined on (0, 1) and 
such that 

(1) f.(t) = O for each n, 

(2) {fn (O} is non-increasing, 

(3) lim | f,(t) dt = 0, 

n=o JO 

then lim f,(t) = 0 except on a set of zero measure. 


n=0 


THEOREM 3.1. In (L?H), p = 1, tn xn x is equivalent to either of the following 


statements: 
(1) For any subsequence {2n,} of {xn}, inf | x,,(t) — x(t) | = 0 except on a set 
k 


of measure zero which depends on the subsequence. 





* All the work of this section would go through without difficulty if the ‘‘inf”’ should be 
defined as the greatest lower bound of the functions involved except on a set of zero 
measure. 
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(2) For any subsequence {Xn,} of {<n} there is a set E of zero measure such 
that for ty ¢E there is a subsequence { Ing, of {tn,} such that lim Tnp, (to) = x(t). 


Proor. Because of Theorem 1.3 it is sufficient to prove this theorem for the 
case Z, —» 6. We first show that 2, —» @ implies condition (2). Let fi(t) = 
finf (| x(t) |, --- , | an(t) |)]’ for each te (0,1). If s is constant then by Lemma 
3.1 lim f2(t) = 0 except on a set E, of zero measure. Let E = paw J: ; then 


n=O 


E is of zero measure and it is the set mentioned in the statement of the theorem. 
For consider any t of the complement of £; lim fz (to) = 0 for each s, and in 


particular for s) = 1. Hence, there exists an integer 8; > 8 such that 
| 2.,(to)| <1. Consider {f5!*"}; lim f3?""(to.) = 0, and so there exists an integer 


82 > 8 such that | z,,(é) | < 3. In this way an increasing sequence {s;} of 
integers can be built up in such a way that | 24;(to.) | <1/7. Then lim 2,,(%) = 0. 


Therefore, E is the set mentioned in statement (2). Since z,, —x 0 is a conse- 
quence of 2, — 6, and since we may apply the above argument to {z,,}, our 
contention is proved. 

It is obvious that statement (2) implies statement (1). 

Now suppose that (1) holds when z = 6. Let {2,,} be a subsequence of 
{zn} and let f.(t) = [inf (| 2n,(0 |, ---, | 2,(é |)J’. Then (1) states that 
inf | t,(4) | = lim [f.(t)]"” = 0 except on aset E of zero measure. Hence, 


1 
lim f.(t) = 0 except on E and lim [ fi(t) dt = 0. Therefore, tz, 2x 0. 
kesco k=oo JQ 


ReMARK 3.1. The following example from (L’H) shows that in an H-normed 
space the H-limit of a sequence is not necessarily unique. Let {z,} be defined 
by zn(t) = 3(1+-7,(2)), where r,(t) = sgn (cos 2rnt).? If E, = E(z,(t) = 1), 
then {H,} is a sequence of sets of closed intervals with no points common to 
any pair. This sequence has the property that the measure of the points 
common to each E,, of any subsequence {E,,} is zero. Consequently, each 


{%n,} of {2,} has the property that lim [ [inf (| 2n,(é) |, --- , | tn, (é) |)? dt = 0, 
=o V0 


and so z, —z 6. Now consider the sequence {z, — 1}. By a study of the 
complements of the E,, sets—the sets on which z, — 1 = —1—it is readily 
seen that z, — 1 —, 6. By Theorem 1.3 it follows that 2, —, 1. Hence 
the sequence {z,} has the H-limits @ and 1. 

Consider the sequence {z,} just defined together with the sequence {z,}, 
where x, = 1—2,. Since {z),} has the H-limits 6 and 1 it follows that H-lim 


t, + H-lim x, may equal either 6 or 1. But H-lim (x, + z,) = 1, so in an 


n=00 


H-normed space x, — x and y, —x y do not imply (2, + yn) Sax & + Y. 





5 The r, functions are closely related to the well known Rademacher functions. 
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Remark. 3.2. By giving different interpretations to || (x, --- , 2a) || in the 
set of all functions defined on (0, 1) and whose pth powers are summable the 
H-limit notion will be found to have different meanings. For example, in the 
case of all summable functions defined on (0, 1) let || (v1, --- , ,) || = max inf 


0stsl 
( [ x(t) dt 


| t 
. aay | [ Xn(t) dt ). The set of all summable functions on (0, 1) 
0 
is an H-normed space according to this norm and it will be designated by 





t 
(LH),. Since [ x(t) dt is a continuous function of ¢ on (0, 1) it is evident from 
0 


t t 
Theorem 2.1 that z, —y» x is now equivalent to lim {| 2z,(t) dt = [ x(t) dt for 
every t € (0, 1). 


4. The space (cH). Consider the set of all convergent sequences {£;} of real 
numbers. Let an element 2, of the set be designated by {£,:} and let || (x, 
.-+,2n) |] = sup | a? |, where by “sup” is meant the least upper bound of the 

lsigeo 


numbers involved, and a; = inf (| &, |, --- , | a: |). Then the set isan H- 
normed space and it will be designated by (cH). 
THEoREM 4.1. A necessary and sufficient condition that x, —» x in (cH) is 


that 


(a) lim fs: =& for i=1,2,---, 
(b) lim lim é,; = lim é;. 
n=co | i=00 too 


Proor. This theorem can be derived from Theorem 2.1 since (cH) can be 
put into a 1-1 norm preserving correspondence with a subset of (CH). The 
correspondence is {£;} to f(t) where f(1/z) = & , f(0) = lim &; and f(é) is linear 

#=00 


elsewhere.° 


5. The space (l?H), p = 1. Consider the set of all sequences {&;} of real 
numbers such that } 7-1 | & |" < «©. Let any element x, be designated by {én}, 
and let || (a1, ---, %») || = [Qo@a | a? |?]'”, where a? = inf (| fs |, ---, | és |). 
Then the set is an H-normed space and it will be designated by (’H), p 2 1. 

THEorEM 5.1. A necessary and sufficient condition that x, 4 x in (lH) is 
that Tim fas = &: fort = 1,2,---. 


Proor. This theorem can be derived from statement (1) of Theorem 3.1 
since (1’H), p = 1, can be put into a 1-1 norm preserving correspondence with 
a subset of (L?H), p = 1. The correspondence is {£;} to f(t), where f(t) = 
i(i + 1)& for 1/(1 + 7) < t < 1/iand f(t) = 0 elsewhere. 





* The proofs of Theorems 4.1 and 5.1 were suggested by the referee. They are used here 
because they are much simpler than those originally given by the author. 
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6. Weak sequential convergence. In the last four sections certain concrete 
sets of elements have been considered as H-normed spaces by giving definite 
meanings to the norms of finite subsets of elements. Each of these sets may 
also be considered as a Banach space by proper definition of the norm for single 


elements. Banach has given necessary and sufficient conditions for weak 
sequential convergence’ in each case considered (Banach, pp. 134-137). An 
examination of our Theorems 2.1, 3.1, 4.1, and 5.1 will readily show that the 
necessary and sufficient conditions for z, —x x are in each case weaker than the 
necessary and sufficient conditions for weak sequential convergence. That is 
to say, %, —x x for the sets considered has been made weaker than weak se- 
quential convergence by ‘the interpretations given the norms of finite com- 
plexes. In this section different interpretations of the norms of finite complexes 
on the same sets considered in the previous sections will be given, and the 
resulting H-convergence notions will be found to be equivalent to weak se- 
quential convergence. 

To this end it is advisable to prove first a general theorem on Banach spaces 
and then to give appropriate interpretations to the norms of finite complexes in 
the different cases to be considered. Let EF be a Banach space and let E be 
its conjugate, or the space of all linear limited functionals defined on F (Banach, 
p. 54). If {f,} is a sequence of and f is a single linear limited functional defined 
on E, then f, — f will always be taken to mean f,(x) — f(x) for every x ¢ E; 
i.e., fn > f will mean that {f,} converges weakly to f (Banach, p. 122). Bounded 
sets of E will be weakly compact if every sequence chosen from a bounded (in 
the norm sense) set of linear limited functionals defined on E is such that there 
is a subsequence {f,,} of {f,} and another linear limited functional fo defined 
on E such that f,, — fo. It should be noted that if f, — f and ||f|| < M, 
where M is independent of n, then || f || < lim || f, || < M (Banach, p. 123). 


In the Banach space £ let || (71, --- , tn) || = max G,(f), where f is a linear 
If 1 |=1 
limited functional on EF and where G,(f) = inf (| f(a) |, --- , |f(@n) |). Galf) 


is a real valued function defined on FE. Because of the introduction of this norm 
for a finite complex the elements which form the Banach space E also form 
an H-normed space (EH). In this section the notions of H-convergence will 
be in terms of this norm. 

Lemma 6.1. If f. — f then G,(f.) — G,(f). 

Proor. This lemma follows readily from the fact that each G,(f) involves 
only a finite number of elements of # and from the definition of f, — f. 

Let F be the subset of F such that if fe F then ||f|| = 1. Let Fo be the 
closed extension of F in the sense that if f, «FF and f, > f then f C Fo. 

Lemma 6.2. If the Banach space E is such that in E bounded sets are weakly 


compact, then max (,,(f) is attained by some element fo C Fo. 
HF 1|=1 





7 The sequence {z,} of a space X is said to be convergent to an element x X in the 
weak sequential manner if for every linear (homogeneous and additive) and continuous 
functional f(x) defined on X, lim f(z,) = f(x) (Banach, p. 133). 

n=00 
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Consider any G,,(f) and a sequence {f,} of elements of F, and let 


PROOF. 
lim Gag (fs) = max Ga (J)- Each G,,,(f.) is actually equal to the absolute value 
of f, at some point of #; for by definition each G,,(f,) involves only a 
finite number of elements of E. Therefore, G,,(f.) = | f.(a:) |, where 7 assumes 
one of the values 1, 2,---, mo. Since the sequence {f,} is infinite there is 
some one 2;, and an infinite subsequence {f.,} of {f.} such that G,,(f.,) = 
| fe,(ig) | for each integral p. Since each || f., || = 1 there is a subsequence 


Ife,.} OF tfsp)} and an fo, of unit norm or less, such that ie — fo. By Lemma 
r] 
6.1 it follows that lim Gn, fuy,) = G,,(fo). This completes the proof. 
7 =00 


TuroreM 6.1. If E is a Banach space such that in E bounded sets are weakly 
compact then x, — wx ts equivalent to the weak sequential convergence of {x,} to x. 

Proor. This theorem is a consequence of Lemmas 6.1 and 6.2 and its proof 
follows almost word by word the proof of Theorem 2.1. 

It is well-known that the sets of elements which form the spaces (CH), (L’H), 
(’H), (p 2 1), and (cH) are also elements of separable Banach spaces if the 
norm for single elements is properly chosen. Banach states (Banach, p. 123) 
that each separable Banach space £ has a conjugate space whose bounded sets 
are weakly compact as sets of functionals over E. Consequently, if the norm 
of a finite complex is defined as in this section for any one of the concrete Banach 
spaces just mentioned, then x, —x x is equivalent to the weak sequential con- 


vergence of {z,} to x. 


7. Comparison of limit notions. It is of interest to note how the gap between 
the H-convergence in H-normed spaces and the K-convergence in K-normed 
spaces may be bridged. It is first necessary to introduce a new limit notion. 

Derinirion 1°. A sequence {2,} of elements of an H-normed set X will be 
said to be convergent in the G-sense, or to be G-convergent, if given any e > 0 
and any infinite subsequence {z,,} of {z,} there exists a Ko = K(e, {xn,}) such 
that for k = Ko there exists a Pp = P(e, k, {n,}) such that k => Ko, s 2 Ko 
and p = Pp» imply 


(1°) | (Tm — Tngy*** 9 Unptyp — In,) | < é 


Derinirion 2°. A sequence {z,} of elements of an H-normed set X will be 
said to be convergent to the element x e X in the G-sense, or to have = as its 
G-limit, written 2, >¢ x or G-lim z, = 2, if given any e > 0 and any infinite 


n=0 


subsequence {2n,} of {z,} there exists a Ko = K(e, {n,}) such that fork = Ko 
there exists a Po = P(e, k, {xn,}) such that k = Ky and p = Po imply 


(2°) || (tm, — 2, +**, Ima, — 2) || <e. 


ToErorEM 7.1. In a K-normed set t, —x x is equivalent to Xn —@ 2. 
Proor. If 2, ¢ x then given any e > 0 there exists an N(e) and if n = N(e) 
there exists a P(e, n) such that n = N(e) and p = P(e, n) imply 
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\| (ta —2, +++, 2nyp—2)||<e. Butforall0SiSp,!|(ta—2, +--+, &n4s—2)|| s 
|| (tn — 2, +++, Lntp — &) || < e, because of Axiom D of K-normed sets. Hence, 
the required inequality holds for every 7 = 0, and so 2, —x 2. 

To prove the converse note that z, —x x implies z,; x x for every sub- 
sequence {2,,} of {z,}. From this fact it is immediately evident that x, >, x 
implies 2, —¢ 2. 

THEeorEeM 7.2. Jf in a K-normed set a sequence {x,} is K-convergent then it 
is G-convergent, and conversely. 

In the proofs of Theorems 2.1, 3.1, 4.1, 5.1, and 6.1 we made use of a property 
which when generalized to cover all cases may be written as 

Property 1. || (a1, ---, 2m, ng1) || S || (G1, --+, Ln) ||. 

This property is actually Axiom D of K-normed sets with the inequality sign 
reversed. Since the norms of the spaces (CH), (L’H), (l’H), (p = 1), (cH) 
and (EH) satisfy Property 1 it is evident that they are not K-normed spaces. 

THEOREM 7.3. In an H-normed space whose norm satisfies Property 1 1, —, x 
is equivalent to Xn 6 2. 

Proor. The proof of this theorem depends on Property 1 in much the 
same manner that the proof of Theorem depends on Axiom D. 

THEOREM 7.4. In an H-normed space whose norm satisfies Property 1 if a 
sequence ts G-convergent then it 1s H-convergent, and conversely. 

Remark 7.1. Consequently, the G-convergence notions may be used in 
place of the H-convergence notions and the K-convergence notions in the 
mutually exclusive classes of H-normed sets satisfying Property 1 and of 
K-normed sets, respectively. 


8. H*-normed spaces. By an H*-normed space will be meant an H- normed 
space whose elements also form a linear normed space and in which convergence 
of a sequence {z,} to x in the norm sense (Banach convergence), i.e., || 22 — 
x || — 0—written x, +, z—implies z, —y x. If an H-normed space satisfies 
Property 1 and if it is linear normed wit! respect to the norm of a finite complex 
when that norm operates on single elemeats then it is evidently an H*-normed 
space. The K-normed spaces are not rm=cessarily H-normed spaces (Vulich 1, 
p. 163) for the K-limit notion is in general stronger than the limit notion in 
the norm sense. 

Remark 8.1. The example of Remar 3.2 is an H-normed space satisfying 
Property 1. Moreover, the set of elements of that example form a linear 


t 
normed space when the norm || z || = max [ z(u) du| is used, and so it is 
ost<1 | Jo 


an H*-normed space. However, this is not the usual norm employed to make 
the set of summable functions on (0, 1) a linear normed space—the usual norm 


1 
is || z|| = [ | z(u) |du. Nevertheless, it is evident that the space (LH): is 


an H*-normed space when x, —z 2 is in accordance with the second norm just 
given and when x, —y 2 is with respect to the norm of a finite complex of (LH): . 








~~” HAN 


)~ 
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By an H-normed space will be meant an H*-normed space in which the H-limit 
is unique and satisfies the condition: 2, —” x and Yn —a y imply (zt, + Yn) x 


(x + y)- 
Remark 8.2. The space (L’H), p = 1, is an H*-normed space which is not 


H-normed. 
Lemma 8.1. If {an} ts @ sequence of real numbers such that a, —», a and if x 
isan element of an H-normed space whose norm satisfies Property 1, then an% —y az. 
TuroreM 8.1. If {x,} is a sequence of elements of an H-normed space X whose 
norm satisfies Property 1, if Xn — x where x « X, and if {a,} is a sequence of real 


numbers such that a, x a, then QnXn —x a2. 

PROOF. Gn%n — AX = (Gn%n — AnX) + (Gnu — ax). From Theorem 1.5 it 
follows that the first expression on the right approaches @ in the H-sense, and 
from Lemma 8.1 it follows that the second expression does likewise. The 
theorem follows from the fact that the space is H-normed. 

Remark 8.3. An H-normed space whose norm satisfies Property 1 does not 
have the property that the derived class is closed; that is to say, if 22 4 Zn 
forn = 1, 2,--- , and if 2, —, 2, it does not follow that there exists a sub- 
sequence {2,"}, where m, < m, < --- < _m, < ---, such that z3* yz. For 
consider the following example from the space of all continuous functions defined 
on (0, 1) with the norm defined as in §6. It has been shown in §6 that in this 
space Z, —y XZ is equivalent to the weak sequential convergence of {z,} to 2; 
L€., Zn a & is equivalent to x,(t) — x(t) for each ¢ e (0, 1) and to the existence 
of a constant M > 0 such that max | 2n(t) | = M (Banach, p. 134). Let {2} 

sts 


be defined as follows: 

tn(t) = x,(0) = O for 2/n St S 1, 

tn(1/n) = 1, 
z,(t) linear elsewhere. This sequence obviously has @ as its weak sequential 
limit. Let z™ be such that 

rn(t) = 2,(t) forO S t S 2/n, 

rn(t) = O for 2/n + 2/m St $1, 

rn(2/n + 1/m) = n, 
rn(t) linear elsewhere. For each fixed value of n, z™ — 2» in the weak sequential 
sense. But since weak sequential convergence implies boundedness of the norms 


of elements it follows that it is impossible to find a sequence of integers m < 
My < +++ < my, < --- such that 7™" —, 0. 


9. Linear operations. Let X and Y be two sets of elements. If to each 
te X there is made to correspond an element y ¢ Y then an operation y = U(z) 
is defined. X will be called the domain and Y the range of the operation. 

Derinition 3. If y = U(z) is an operation which transforms X into all or a 
part of Y, then if 
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(1) X and Y are H-normed spaces y = U(z) is said to be HH-continuous at 
the point 20 if 2, x 2 implies U(x.) 2 U (20), 
(2) X is an H-normed space and Y is a linear normed space y = U(z) is said 
to be HB-continuous at the point 2 if t, —x 2 implies U(x) >, U(x), 
(3) X is a linear normed space and Y is an H-normed space y = U(z) is said 
to be BH-continuous at the point 2 if x, —s % implies U(x,) >» U (2), 
(4) X and ¥ are linear normed spaces y = U(z) is said to be BB-continuous at 
the point 2 if x, x 2 implies U(r,) >» U(ao). 

If in any one of the above cases the specified continuity condition holds for 
every point of Y then the operation is said to be continuous in that sense on X. 

TuroreM 9.1. If X is an H*-normed space, Y is an H-normed space, and 
y = U(x) ts HH-continuous, then it ts BH-continuous. 

ProoF. 2, — 2X implies 2, —, x in an H*-normed space, so by the HH-con- 
tinuity of y = U(x), 2, > 2 x implies U(r) x U(2). 

TuroreM 9.2. If X is a linear normed space, Y is an H*-normed space, and 
y = U(z) is BB-continuous, then it is BH-continuous. 

THrorEM 9.3. If X is an H-normed space, Y is an H*-normed space, and 
y = U(x) ts HB-continuous then it is HH-continuous. 

THreoreM 9.4. If the additive operation y = U(x) transforms an H-normed 
space X into a space Y of the same type and ts HH-continuous at a single point, 
then it ts HH-continuous on X. 

THEOREM 9.5. Statements similar to that of the previous theorem hold when 
y = U(«) is HB-, BH-, or BB-continuous at a single point, X is H-normed, linear 
normed, or linear normed, respectively, and Y ts linear normed, H-normed, or linear 
normed, respectively. 

THEoreM 9.6. If y = U(x) ts an HH-continuous and additive operation from 
an H-normed space X to an H-normed space Y, then it is homogeneous. 

Proor. The proof of this theorem depends on the uniqueness of the H-limit 
notion in H-normed spaces. 

THEOREM 9.7. Statements similar to that of the previous theorem hold when 
y = U(2) is additive and HB-, BH-, or BB-continuous, X is H-normed, linear 
normed, or linear normed, respectively, and Y is linear normed, H-normed, or linear 
normed, respectively. 

In case the operation is from a domain X to the set of all real numbers, which 
set is readily seen to be an H-normed space when the norm of §1 is used, it will 
be called a functional and will be designated by y = f(x). Since in the H-normed 
space of real numbers (see §1) the H- and B-convergence notions are equivalent 
(see Theorem 1.1), the four types of operations of Definition 3 reduce to two 
in the case of functionals. These will be referred to as the H- and B-continuous 

functionals. 


10. General forms of H-continuous and linear functionals. Remark 10.1. 
Let X be an H*-normed space. By Theorem 9.1 every H-continuous functional 
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on X is B-continuous on X. Consequently, the general form of the H-continu- 
ous and linear functional on X must be of the same nature as but less general 
than the general form of the B-continuous and linear functional on X. 

10.1. The space (cH). The elements of this space form a Banach space (c) 
when the norm of a finite complex operates as a norm for single elements. Then 
(cH) is an H*-normed space. The general form of the B-continuous and linear 
functional for the space (c) is f(z) = C lim &; + > C;&; (Banach, p. 66). 

TurorEM 10.1. The general form of the H-continuous and linear functional on 
(cH) is f(z) = C lim & + doh Cik: « 

Proor. By Remark 10.1 every H-continuous and linear functional on (cH) 
is of the form of the B-continuous and linear functional on the same elements 
taken as a Banach space but with additional restrictions on the constant C;,’s. 
Assume that f(x) is H-continuous and linear and that there is an infinite number 
of non-zero constant C;’s involved in its representation. Consider the sequence 
(x,} of (cH) where &.; = 0 if nm ¥ 7 and gj, = sgn 1/C,. This sequence is 
obviously such that 2, —, 6. But for an infinite number of values of n, f(z,) = 
1, so lim f(t,) = 1. Consequently, x, —z 6 does not imply f(z,) 20. There- 


n= 


fore, the possibility of the general form of the H-continuous and linear func- 
tional on (cH) involving more than a finite number of non-zero constants has 
been eliminated. Consequently, we are confined to a consideration of the 
form C lim & + ne C;é;. This functional is readily seen to be H-continuous 


and linear. 
10.2. The space (lH), p 2 1. Remarks similar to those preceding Theorem 


10.1 apply here. 

TueoreM 10.2. The general form of the H-continuous and linear functional 
on (?H), p = 1, is f(x) = Doha Ciéi 

Proor. The proof of this theorem is similar to the proof of Theorem 10.1. 

Remark 10.2. If K is a finite complex then by U(K) is meant the complex 
of the images of the elements of K by y = U(z) (Vulich 1, p. 165). Vulich has 
proved the following theorem for operations defined on K-formed spaces: A 
necessary and sufficient condition that the additive operation y = U(x) be KK- 
continuous is that there exist a constant C > 0 such that for every complex K 
of the domain || U(K) || S$ C || K ||. A similar statement cannot be made for 
HH-continuous and additive operations on H*-normed spaces. For in the 
space (lH), p = 1, let K be the complex (2; , x2) where &: = 0 if p ¥ 7 and 
Son = 1, and let f(x) = & + &. Then || (a1, 22) || = O and || f(a), f(a) || = 1. 
Hence, there exists no constant C > 0 such that || f(K) || S$ C || K ||. 
10.3. The space (CH). ‘The elements of (CH) form a Banach space (C) when 
the norm of a finite complex operates on single elements only. Then (CH) is 
an H*-normed space. The general form of the B-continuous and linear func- 
tional defined on the elements of (CH) when they are taken as the Banach space 
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1 
(C) is f(x) = [ z(t) dg(t), where g(t) is a function of bounded variation and 
0 


z(t) is of (C) (Banach, p. 61). 
THroreM 10.3. The general form of the H-continuous and linear functional 
1 


on (CH) is f(z) = [ x(t) dg(t), where g(t) is a function of bounded variation which 
0 


is constant except for a finite number of discontinuities. An alternate form is 
f(z) = Dota Car(s). 

Proor. It will be shown first that g(t), which must be of bounded variation 
because of Remark 10.1, cannot be continuous and non-decreasing (or non- 
increasing) on (0, 1) with g(t) > g(0) (or g(t) < g(0)) fort > 0. Suppose that 
g(t) is of this form. Define a sequence {z,} of continuous functions as follows: 


z,(t) = z,(0) = O for 2/n St S 1, 
ljn 

z(1/a) = I, whee T= [ t dg(t),® 
0 


and z,(t) linear elsewhere. Obviously, z, —z 9. It will be shown that lim 


n=o 
2/n 


1 1 1/n 
[ z,(t)dg > 0. Write [ x,(t) dg = [ Xn(t) dg + / Xn(t) dg. On the 
0 0 0 1/n 


1/n 1/n 
interval (0, 1/n), xe(¢) = ntl. Then [ z(t) dg = [ nltdg = nI~l = 
0 0 
n. On the other hand, on the interval (1/n, 2/n), z,(t) = —nI~'(t — 2/n), 
2/n i 2/n 2/n 
whence / tn(t)dg = —nr | (t — 2/n) dg = —nr™ [ tdg + 20" 
1fn 1/n 1/n 


2/n 2/n 2/n 


dg. Since in (1/n, 2/n), t S 2/n, it follows that i tdg S2/n : dg, 
1 1/n 


1/n /n 
2/n 2/n 


2/n 
and so i t,(t)dg = —2I™ dg + 2I dg = 0. Consequently, 
1/n 1/n 1/n 
[ z,(t) dg = n, and therefore, if g(t) is of the form specified at the beginning 
0 


1 
of this proof, x(t) is of (CH), and f(x) = [ x(t) dg(t), then there is a sequence 
0 


{zn} such that z, —, @ but for which {f(z,)} does not approach zero as n — . 
Then f(x) is not H-continuous. So, in general if f(x) is H-continuous and linear 


1 
it may not be of the form | x(t) dg(t), where g(t) is continuous and such that 
0 


for some interval (t , t’), g(t.) < g(t) (or g(t.) > g(t)) for t > t. Since if this 
were the case the sequence {z,} used above could be constructed on the in- 
terval (é, ¢’) and then a contradiction would follow. 

Therefore, g(t) must be constant except for at most a denumerable number 
of points of discontinuity of the first kind. Assume that g(é) is of this nature 





§ By the integration by parts theorem for Stieltjes integrals it is readily shown that the 
denominator does not vanish. 
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and that f(z) is an H-continuous and linear functional on (CH). Let {a,} bea 
sequence of points of (0, 1) which approaches some a» from the right (or left) 
and which has the property that at each a, the function g(t) has a discontinuity 
of measure b, ~ 0. Each a, may itself be the limit point of points of discon- 
tinuity of g(#). Let such points be designated by a,» and let the measure of 
the discontinuity of g(t) at np be bap # 0. About each a, choose an interval 
], such that the left-hand end point of J, is to the right of a>, such that the 
length of I, < min (length J,1, 1/n), and such that if a,,¢I, then }°5-1 | bap| < 
|b, |/2”. Define a sequence of continuous functions as follows: let z,(a,) = 
sgn 1/b, ; on the complement of I n let 2,(t) = 0, and finally let z,(¢) be linear 


elsewhere. Then z, —y @. [ 2r(t)dg = bo [g(@np + 0) — glanp — 
0)]zn(Qnp) + Ontn(An) = > Dnptn(Gnp) + 1. If each bapta(Qnp) < 0, then 
[= do = 1 — D2 [bap ||ta(dny) | 21 — LoZ-a | bap |/2" |b] > 1/2. If 


1 
each bapTn(Anp) is not less than zero then [ z,(t) dg is obviously greater than 
‘) 


1/2. Therefore, although z, —x @ it does not follow that f(z,) -: 0. Hence, 
if f(z) is to be H-continuous and linear it is impossible that g(t) be other than 
constant except for at most a finite number of points of discontinuity of the first 


kind. In this case f(x) = [ x(t)dg = >of x(t)[g(ti + 0) — g(ti — 0)) = 


yo Cix(t,). Since only a finite number of terms are involved in this form f(x) 
is obviously H-continuous and linear. 

Remark 10.3. It is evident from this theorem that there are B-continuous 
and linear functionals which are not H-continuous. Therefore, the converse of 
Theorem 9.1 is not true. 

10.4. The space (L’H), p 2 1. Tueorem 10.4. If y = U(x) is a continuous 
transformation from (L?H), p = 1, toa space Y in which the limit notion is unique, 
then y = U(x) is a constant. 

Proor. Let xe (L?H), p = 1, and let y,(t) = x(t)-1/2(1 + r(t)), where 
r,(t) is the function defined in Remark 3.1. As in Remark 3.1 it is seen that 
Yn nu 0 and y, >” x. Since y = U(z) is continuous it follows that U(yn) — 
U(6) and U(y,) — U(x). Because of the uniqueness of the limit notion in 
Y, U(x) = U(@). 

Corotuary 10.41. If y = f(x) is an H-continuous and linear functional from 
(LH), p = 1, then f(x) = 0. | | 

11. General forms of different types of linear operations. If for the set of all 
continuous functions defined on (0, 1) the norm for single elements is taken to 
be sank | x(t) |, then the set is a Banach space and it will be denoted by (C). 


With respect to this norm the space (CH) is an H*-normed space. It has been 





* This theorem and its proof were supplied by the referee. It is a generalization of a 
theorem originally given by the author. 7 
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shown (Fichtenholz, p. 32) that the general form of the BB-continuous and 
linear operation from (C) to (C) is given by 


(1) y(s) = U(x) = [ a(t) d, K(s, 2), 


where K(s, ¢) is a function of two variables defined on 0 S s, ¢ S 1 and is such 


that 


(1°) K(s, t) is of bounded variation as a function of ¢ for each s and var 
0st 


K(s, t) S.C, where C is a positive constant independent of s, 

(2°) K(s, 0) = 0; K(s, 1) is continuous in s, 

(3°) K(s, t) is continuous in measure” with respect to s in the interval (0, 1) 
for each s = &. 

If y(s) = U(x) is BH-continuous and linear from (CH) to (CH), then y(s) = 
U(x) is a B-continuous and linear functional defined on (CH) and it may be 


1 
represented by y(so) = U(x) = [ x(t) d,K(so , t), and so y(s) = U(z) is of form 
0 


(1) with restrictions on K(s, ¢) which will now be examined. 

THEOREM 11.1. The class of all BH-continuous and linear operations from 
(CH) to (CH) is equivalent to the class of all BB-continuous and linear operations 
from (CH) to (CH). 

Proor. Because of Theorem 9.2 it is evident that the class of all BB-con- 
tinuous and linear operations from (CH) to (CH) is a subclass of the class of 
all BH-continuous and linear operations between the same spaces. Therefore, 
since each BH-continuous and linear operation is of the form (1), in order that 
an operation be BH-continuous and linear without being BB-continuous and 
linear it is necessary that some one of the conditions 1°, 2°, 3° on K(s, ¢) be 
weakened. But these conditions are necessary and sufficient not only for the 
BB-continuity and linearity of y = U(x) but also for the continuity of y(s) = 
U(x). Hence, if one of these conditions is weakened y(s) will no longer be 
continuous. Therefore, every BH-continuous and linear operation must satisfy 
conditions 1°, 2°, and 3° and our theorem is proved. 

THEOREM 11.2. The general form of the HH-continuous and linear operation 
from (CH) to (CH) is given by (1) where K(s, t) satisfies conditions 1°, 2°, 3° and 
also 

(4°) for s constant K (s, t) is constant in t except for at most a finite number of 
points of discontinuity of the first kind. 

Proor. If y(s) = U(z) is of form (1) and if K(s, é) satisfies conditions rf, 
then y(s) is continuous because of conditions 1°-3°. Moreover, if x, x 2, then 
Yn(s) — y(s) for each s € (0, 1) and so y, zy. For consider any 8 = 8, then 


1 
| yn(So) — y(8)| S | [ [tn(t) — x(t)] d,K(so, t)|. The last expression is the 





” K(s, t) is said to be convergent in measure with respect to s fors = s9on0 <1 <1 
if K(s, t) as a function of t approaches K(so, t) in measure when s — 80. 
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absolute value of an H-continuous and linear functional (see Theorem 10.4 and 
condition 4°), and therefore, 2, 4 x implies yn(s) — y(so). This proves our 
contention. 

If y(s) = U(x) is an HH-continuous and linear operation from (CH) to (CH), 
then it is of form (1) and K(s, ¢) satisfies 1°-4°. For consider any s = 8, then 
y(%) = C (2) is merely an H-continuous and linear functional on (CH). Hence, 


a(t) d,.K(s0 , t) and K(so , ¢) must satisfy 4° because of the restriction 
i 


1 
of Theorem 10.4. Then y(s) = [ x(t) d,.K(s, t). Since y(s) = U(x) is HH- 
0 


continuous and linear and (CH) is an H*-normed space, it follows from Theorem 
9.1 that y(s) is BH-continuous and linear. By Theorem 11.1, y(s) = U(x) is 
also BB-continuous and linear and so K(s, ¢) must satisfy conditions 1°-3°. 

THEOREM 11.3. The general form of the HB-continuous and linear operation 
from (CH) to (CH) is given by (1) where K(s, t) satisfies conditions 1°-3° and also 

(5°) there exists at most a finite number of points t; , te, --- , tg such that for any 
value 8 = 8 the points of discontinuity of K(8 , t) are to be found among the points 
i(i = 1, 2,---, q). 

Proor. As in the proof of Theorem 11.2 it is readily seen that any H B-con- 
tinuous and linear operation is of the form (1). If y = U(x) is HB-continuous 
and linear, then by Theorem 9.3 it is HH-continuous and linear, and so by 
Theorem 11.2 K(s, t) satisfies conditions 1°-3°. Suppose that 5° is not satisfied 
by K(s, t). Then there exists a sequence {s,} of points of the interval (0, 1) 
such that the points of discontinuity of the first kind which are contributed by 
{K(s, , t)} are infinite in number, that is to say, there is an infinite sequence 
{,} of points such that each is a point of discontinuity of some K(s, , t). Choose 
a subsequence {t,,} of {t} such that {t,} approaches some & from the right 
(or left). About each ¢,, as center and associated with each K(s, , ¢) there is 
an interval J(t,, , 8.) such that within it there is no point of discontinuity of 
K(s, , t) except perhaps 1,, itself. The existence of this interval is assured by 
the HH-continuity of each HB-continuous operation. Let J,, be an interval 
I(t, , 8n) Which does not contain t and whose right-hand end point is to the 
left of t,, + (t,, — t)/2 and which is such that the measure a;, of the discontinuity 
of K(s,, t) at t,, is not equal to zero. Define a sequence {z,} of continuous 
functions as follows: 


y(80) = 


In(t,,) = sgn 1/ay,, 


rn(t) = 0 for ¢t not of J,, , 


1 
and x,(t) linear elsewhere. Then 2, —x 6. Moreover, y,(s) = [ z,(t) d,K(s, t) 
0 


is such that | yn(Sn) | = | ta(t,,) |-| K(8e, &, + 0) — K(s., 6, — 0) | = | sgn 
I/a;, |-|a;,| = 1. Therefore, || y, || = max | yn(s) | = 1, and so 2, —% 6 does 
0<s<1 


not imply y, —, 6. This is a contradiction and so 5° must hold. 
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Let y(s) = U(x) be of the form (1) and be defined on (CH), and let K(s, t) 
satisfy conditions 1°, 2°, 3°, and 5°. Because of conditions 1°-3°, y(s) is con- 


1 
tinuous. Moreover, y(s) = | z(t) d.K(s, t) = >i a(t;)[K(s, t; + 0) — K(s, 
0 


t; — 0)] = >02.1 2(t,)Ci(s) by condition 5°. From 1° we know that there exists 
a constant C > 0 such that | C.(s) | S C for each 7 and for se (0,1). Then 
| yn(s) — y(s)| SC D-2.4 | ea(ts) — x(ti) |. If 22 x 2, then since only a finite 
number of points are involved in the last summation, || y, — y || — 0. 
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CHARACTERIZATION OF COMPLEX COUPLE SPACES 


By ArIsTOTLE D. MicoaL aNnD Max WyMAN 
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Introduction. In considering generalisations of classical complex analysis for 
complex Banach spaces it seems desirable to consider complex couple spaces. 
For such spaces many of the classical theorems can be shown to hold." The 
following paper attempts to characterize such spaces. This is done by means 


of a conjugate operation. 


1. Characterization of Hermitian couple spaces. 

DEFINITION 1.1. Two complex Banach spaces B; , Bz shall be called equiva- 
lent’ if there exists a single valued function f(z) on B,; to Bz with the properties 

(a) f(z) is linear® in z, and is homogeneous of degree one, 

(b) f(z) ranges over the whole of Bz , and is solvable in z, 

(c) [| #2) Il = lle ll. 

If conditions (a), (b) are satisfied we say B,, Be are linearly homeomorphic. 

Let E be a real Banach space with a real inner product [z, y]' From E we 
can construct a complex Banach space E(c) as follows. Let E(c) be the set of 
all couples {z, y} where x, y are elements of E. We define 


(1) {t1, Wi} = {ae, yo} if and only if N=, = ys; 

(2) ltr, ys} + tte, Yo} = ter + te, 41 + ya}, 

(3) (a + ib) {x,y} = {ax — by, bx + ay} where a, bare real numbers, 
(4) iz, ll = (lel? + Uy IP 


DeFIniTIon 1.2. A complex Banach space B is called a Hermitian couple 
space, if it is equivalent to a space of type E(c). 

DEFINITION 1.3. Let B be a complex Banach space. A bilinear function 
(Z, U] on B* to the complex numbers is called a Hermitian inner product’ if 


(i) [Z, U] = [U, Z], [(@ + %b)Z, U] = (a + %b)[Z, UI, 








' For work in general analysis see Taylor (3), Michal and Martin (4), and Michal, Davis 
and Wyman (6). 

* For the notion of equivalence of real Banach spaces see Banach (1), p. 180. 
_ + We use the term linear to mean additive and continuous. In real Banach spaces this 
1s enough to ensure homogeneity of degree one. This is no longer so in complex Banach 
spaces. 
‘ For postulates of a real inner product see Michal (5). In addition we make the restric- 
tion that || a || = [z, a}. 

* These are of course the same postulates given for an inner product in Hilbert space. 
See Stone (2) and the references to Von Neumann’s papers given there. 
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(ii) [Z, Z] = O and [Z, Z] = Oif and only if Z = 0, 
(iii) || Z || = (2, ZF. 

THEOREM 1. A necessary and sufficient condition that an arbitrary complex 
Banach space B be a Hermitian couple space is that 
(a) B must possess a Hermitian inner product [Z, U} 
(8) there exist a function Z° on B to B such that 


(8:1) [Z1, Z2] = (Z1, Zel, 
(G2) Z = Z. 


Proof of necessity. For a space E(c) of couples z = {2, y} the existence of 
a Hermitian inner product’ [z, u] was shown in Michal, Davis and Wyman (6). 
The function Z for such spaces is the ‘‘ordinary complex” conjugate Z = {z, —.y}. 
Let Z = f(z) be the mapping function which makes B equivalent to E(c). We 
define 
(a) [Z, U] =[f-'(Z), f ‘(U)], where the inner product on the right is that of 
E(c), 

(b) Z = fi). 

By straightforward calculation [Z, U] is a Hermitian inner product for B, 
and condition (8) is satisfied. To prove the sufficiency we make use of the 
following lemmas. 

Lemma 1. Let B be a complex Banach space satisfying (a), (8) of theorem 1. 
Then Z is linear in Z, and 
(1.1) (a + 1b)Z = (a — ib)Z, for a, b real. 

For all Z; , Zz , Z3 we have 


(1.2) [Z; + Ze ’ Z3] = [Z, + Z2 ) Zs] ad [Z, ) Zs] + [Z- ’ Zs] = [Zi + Z2 ’ Z3]. 





Thus Z,; + Z: = Z, + Z:, and Z is additive in Z. From || Z || = || Z || we can 
conclude that Z is linear in Z. Similarly property (1.1) is shown. 

Lemma 2. The totality E of all elements W for which W = W is a real sub- 
Banach space of B and is such that || W || is generated by a real symmetric inner 


product. 
Clearly under the operations of B, E is a real normed linear space. If {Wi} 


is a Cauchy convergent sequence of E there is an element W of B such that 
|W —W,l| < 5. From ||Z|| = || Z|| we obtain || W — W, || < 5, and 


hence ||W — W|| <e. Thus W = W and Wis an element of Z. Thus £ is 
a complete space, and hence is a real Banach space. 





° We have used the notation Z asa function on B to B. This should not be confused 
with the complex conjugate notation of ordinary complex numbers. See (81) where both 


occur. 

7 If x, y, ¢, n are elements of FE, and z = {z, y}, u = {f, n} arein E(c), then the Hermitian 
inner product [z, u] of E(c) is given by [z, u] = [z, ¢] + [y, 7] + iffy, ¢] — [2, a]}, where 
{z, y] is the inner product of EZ. 
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For any Wi, W2 of E we have 
(1.3) (Wi, W,] = [W, W)] = (Wi, W2] = [W2, Wi). 


This completes the proof of lemma 2. 
From the £ of lemma 2, let us construct the Hermitian couple space E(c). 
[Z+Z iZ — iZ\. 
For any element Z of B, 1 3? zr | is an element of E(c). Then B is 





equivalent to E(c) and the required mapping function is 
_f{Z+Z iZ -iZ\ 


The additivity and homogeneity of degree one of f{(Z) are easy to verify. From 
\| Z ||’ = [Z, Z] we readily obtain that 
iZ |\|"\3 
ry = ZI 


zy) = (WEEE We = 








4 


Finally the equation f(Z) = {W1, W2} has the unique solution Z = W, + iW, 
for any W,, W: of #. Thus B is a Hermitian couple space. 


2. Characterization of complex couple spaces. If EH is an arbitrary Banach 
space, the couple space E(c) is a complex Banach space under definitions (1)—(4) 
if and only if the norm of E is generated by a real inner product. In a previous 
paper (Michal, Davis and Wyman (6)) the authors raised the question of giving 
a definition to replace (4) which would eliminate the restriction on the norm of 
E. A definition was communicated to us by E. W. Paxson which could be used 
if E were separable. Later on A. E. Taylor’ gave us verbally a definition which , 
places no restriction on EZ. This result is stated in the following theorem. 

THEOREM 2. Let E be an arbitrary real Banach space, and let x, yeE. The 
totality E’(c) of couples {x, y} form a complex Banach space under definitions (1), 
(2), (3) and 


(¥’) l| fz, y} || = sup | (P(x) + P(y))’.° 


The truth of this theorem is easily verified. Complex spaces which are linearly 
homeomorphic to spaces of the type E’(c) are called complex couple spaces. 

THEOREM 3. A necessary and sufficient condition that an arbitrary complex 
Banach space B be a complex couple space is that there exist a function Z on B to 
B with the properties: 


I At+Z=44+h, 
* Both Taylor and Paxson have seen the manuscript of Michal, Davis and Wyman (6). 
Taylor had also seen Paxson’s result. 
. U(x) means a linear functional on E to the real numbers. The superior is taken over 
all linear functionals whose modulus is one. 
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Il iZ = —1Z, 
Il Z = Z, 
IV |Z || = || Z|}. 


The proof follows essentially the same lines as the proof of theorem 1. Z 
will now trivially satisfy lemma 1, and the first part of lemma 2 will be satisfied. 
From E we construct E’(c) and the latter will be linearly homeomorphic to B. 
The continuity of the mapping function f(Z) = 2 us - = ” 4 follows 





from || f(Z) || S V2 || Z|. 
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SEQUENCES DEFINED BY NON-LINEAR ALGEBRAIC DIFFERENCE 
EQUATIONS’ 


By Oris E. LaANcastER 
(Received March 7, 1938) 


An algebraic difference equation 
(1) &(y(x + m), y(x + — 1), a y(z), x) ies 0, 


6a polynomial with rational coefficients in its arguments z, y(x), y(z + 1), ---, 
y(t + m), defines a sequence” 


y(a), y(a+1),---,y(a +m — 1), y(a +m), --- 


when the values of y(x) are assigned at the m pointsa,a+1,---,a+m-— 1. 
Obviously every equation (1) defines an infinite number of sequences: one for 
each set of values y(a), y(a + 1), --- , y(a +m — 1). 

We propose to obtain some of the properties of such sequences by studying 
the difference equations which define them. In the study of infinite sequences 
one is mainly interested in their ultimate behavior. Hence we are here in- 
terested in the behavior of the solutions of the difference equations, for integral 
values of z, in the neighborhood of infinity. Although, at present, it is not 
possible to write out explicitly the solutions of most non-linear difference equa- 
tions, it seems that it might be feasible to determine whether a given equation 
defines sequences that approach zero as x becomes infinite by considering the 
solutions of the linear difference equation formed by omitting the non-linear 
terms. (We assume that at least one linear term appears.) For when y(z) 
approaches zero the linear terms are infinitesimals of the first order, while the 
non-linear terms are infinitesimals of higher order. Therefore, it would be 
expected that the behavior of such sequences is largely determined by the linear 
terms of the difference equation. With this idea in mind, we attempt to gain 
information about the sequences defined by a non-linear difference equation by 
considering the solution of the difference equation formed from its linear terms.’ 

It is clear that once we have criteria for determining when a difference equa- 
tion defines sequences which approach zero we can easily determine whether 
one defines sequences that approach a constant limita. For, if after the applica- 
tion of the transformation y(z) = g(x) + a, the transformed difference equa- 





1 The author’s thanks are due to Professor G. D. Birkhoff for his counsel during the 
preparation of this paper. 

* When the equation (1) is non-linear in the variable y(z + m) some convention must 
be made as to the one of the values of y(z + m) that is to be taken at each step. 

* The investigations of Lattés (1), Horn (2) and Perron (3) have also had as their basis 
the same idea. 
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tion defines sequences that approach zero, then the original equation defines 
sequences that approach a. 

Besides determining conditions sufficient to insure that the sequences of a 
difference equation converge to a definite limit, we study the effect of the degree 
of the equation, of the order of the equation, and of variable coefficients upon 
the rate of convergence of the sequences. Lastly we determine difference equa- 
tions whose rational sequences converge most rapidly to irrational numbers of 
the form 8”, where p is an integer and 8 is a rational number. 

For the discussion we write (1) in the form 


(2) P(Srim ? Snim-1 yom Ce Sn ’ n) Ts 0, 


where S,4; denotes the value of y(n + 7) when is an integral value of x. The 
sequences are then denoted by {S,}. Of course, when we speak of the solutions 
of the equation (2) it is understood that n is a continuous real variable, and 


y(n) = Sn. 


1. Sequences that approach zero. 

Derinition. The phrase “the difference equation of the linear terms’’ shall 
denote the difference equation formed from the given equation by omitting all 
terms that are of higher than the first degree in the arguments S,, Snii,---, 
Snim, Where it is understood that the linear equation is to be of the same order 
as that of the non-linear equation. 

For example, the difference equation of the linear terms of the non-linear 


equation, 


Sn42SnSnyi + NSrSn+2 + Sn + NSnvt + n - 0 


0- Srs2 + NSn41 + Sn nit —n’. 
The characteristic equation of this linear equation is 
0-p + 1p +0 =0. 


It follows from the above definition that besides the usual possibilities for 
infinite and zero roots of the characteristic equation, the characteristic equation 
of the equation of the linear terms always has an infinite root if Sn4m does not 
appear linearly in the non-linear equation and a zero root if S, does not appear 
linearly. 

THEorEM 1. If in a non-linear algebraic difference equation a coefficient of a 
linear term is of as high degree in n as the coefficients of the non-linear terms, and 
uf there are solutions of the difference equation of linear terms which approach zero 
as n becomes infinite, and if the characteristic equation of the difference equation 
of linear terms has finite, non-zero roots not equal to one in absolute value, then there 
are sequences defined by the given equation that approach zero as n becomes posi- 
tively infinite. 
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The truth of this theorem follows immediately from the following theorem 
of Horn”. “In the system of non-linear difference equations 
(3) yx + 1) G,(z, yi(z), al” Ym(2)), (¢ = 1,2,.-- »m) 


let G; be a function of 2, yi(x), --+ , ¥m(%), which vanishes at the point z = «, 


y= 0, +++, Ym = 0 and is analytic in a neighborhood of it: 


G; (2, y:(2), +, Ym(2)) = 2+ aay +++ + GimYm +++ (i = 1, +--+, m). 


Ifs — a,-++,8 — Gm are the elementary divisors of the determinant 
| ai; — 864; |, 

let the system be transformed to the system 
(4) yaa + 1) = ays(x) + fi(1/z, xr(2), «++ , Ym(2)) (¢ = 1,---, m) 
where 
Fi(/ar, yr y +++ y Ym) = Dy ASR ryeeeeed (1/2) Ya? ++ + yr 

(X= 1, =A = +e) =Am =O: A$EM+-+) + Am = 2) 
and 


WA 
eS 
3 


O0<|a|Sl[a|S---Sla|<1<lay|s--- 
If it is possible to choose h so small and 2 so large that 
| fi(1/z, yr, +++ Ym) | < hg 
and 
[FL /a, yay ++ 5 Ym) — FL /ay yay s+, Ym) | 
(5) <g(lyi—-yl|t-:-+ lyn — ym|)forz>m,|ys| Sh, |yi| Sh 
(¢ = 1,2,---,m), 
where g S 3 asP(1 — Jac), @ = 1,2,---,u)3 @ S$ las Mal — 1) 
(@=y+1,---,m),g < g/mand if 9; S h/2, then there exists a unique solu- 


tion’ y;(z), (¢ = 1,---,m) of the system for which | y;| < A for x > 2, 
yi(to) = ni (¢ = 1, 2,--- , mw) and lim y,(x) = 0.”° 


zo 





‘In order to obtain a unique solution Horn admitted only constants for the arbitrary 
periodic functions of period one. The theorem holds when any periodic functions are 
admitted if the »; are defined over the interval (xo , to + 1). 

* Although the theorem above is ‘‘translated’’ directly from Horn’s paper (2), we see 
that if two or more of the a’s are equal then the system (3) cannot in general be brought 
to the form (4). Since, however, an obvious extension of his work shows that the theorem 
holds for the case of equal a’s, we assume the theorem for the general case. If some of 
the a’s are equal the equations (4) are of the form 


yi(x + 1) = aiYyi + ky:-1(x) + f(l/z, Yry*** ys Ym) 


when Qi = ij. 
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We prove Theorem | by showing that the conditions of Horn’s theorem are 
all fulfilled. Since, by hypothesis, a solution of the equation of the linear 


terms approaches zero as n > ©, ®(Snim,-+-+, Sn, ) vanishes at the point 
n= 0, S, = 0,---,Snim = 0. Hence (2) may be written in the form 


OmSn+m + Retetent + Ny + Sn 


(6) -—] 
+ i (Sn4m , Sn4m—1 > oy Sn ) n) + ain 


where ®, is a polynomial whose terms are of higher than first degree in the 
arguments 1/n, S,,---,Sn4m. It follows from the assumption that the roots 
of the characteristic equation of the equation of the linear terms are finite, 
that Gm ~ 0. And when a,, ¥ 0, (2) treated as an algebraic equation in S,,., 
has a solution 


(7) Snim = S(Sn+m—-1 oes S,, n) 


which vanishes at the point Sz,n. = 0,---,S, = 0,n = ». That is, the 
expansion of f in terms of 1/n, S,,---, Snim-1 does not contain a con- 
stant term. 

We show that equation (7) defines sequences that approach zero, from which 
it follows that equation (2) defines sequences which approach zero. Replacing 
Snim in ® by f, we have (7) in the form 


1 
(8) Sain = an [ao Sn + a Sait +... + Am—1Sn+m—1 
+ 4,(f, Snim-t, salad , Sn, n) a a_n']; 


where the terms in the expansion of #, are of higher than first degree in 1/n, 
Sn,-+++,Sn4m1. Upon setting w(n) = Sr, ue(n) = Snyi,---, Um(n) = 
Snim-1, We obtain from the equation (8) the system of difference equations 


u(n +1) = uiyr(n) (¢ =1,2,.--,m-—1) 


(9) Um(n +1) = -+ [agus (n) + +++ + Gm—1Um (n) 


+ @i(f, Um (n), -++, us (n),n) + aan’). 
The auxiliary equation of this system, 
—p i 68 8 
o -—p 1 0 
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is equivalent to the characteristic equation of the equation of the linear terms, 
viz. 


(10) Amp” + Gap” +--- +a=0. 


When the roots of (10) are distinct, we transform the system (9) into the 


eanonical form 
(11) 2i(n + 1) = pizi(n) + W,(1/n, a(n), 6. % » Zm(N)) (a = 1, 2, wax , m) 


and when (10) has equal roots, into one of the same form, except that when 
a; = 4; the j” equation is 


z;(n + 1) _ pjz;(") + z;-1(n) + W ,(1/n, a(n), oo. , &m(n)). 


The ¥; (¢ = 1, --- ,m) satisfy the condition (5), since the terms in their ex- 
pansions, except the one a_1/n, are of higher than first degree in the variables 
1/n, a(n), --- ,2m(n). Therefore, there are sequences z;(n) which approach 
zero as n — ©, and consequently, sequences {S,} defined by (7), that approach 
zero. Q.E.D. 
It follows from the proof of Horn’s theorem that an algebraic difference equa- 
tion (linear or non-linear) cannot have more than one solution that approaches 
zero as n — © if the roots of the characteristic equation of the difference equa- 
tion of the linear terms are greater than one in absolute value. Hence if there 
is only one solution of the difference equation of the linear terms that approaches 
zero as n becomes infinite there is at most one solution of the non-linear differ- 
ence equation that approaches zero. However, if there are an infinite number 
of solutions of the difference equation of the linear terms which approach zero 
and if the roots of the characteristic equation are not equal to one in absolute 
value, then there are an infinite number of sequences of (7) that approach zero. 
Of course, it is clear that if a sequence is to approach zero the initial conditions 
Sno+i (¢ = 0,---,m — 1), which determine it, must be chosen sufficiently 
small. Moreover, if only one of the roots of the characteristic equation (10) is less 
than one in absolute value, then only one of these initial conditions is arbitrary. 
If two roots are less than one in absolute value then two conditions are arbitrary, 
etc. If all the roots are less than one in absolute value, then all sequences 
defined by (7) converge to zero, for an arbitrary choice of initial conditions in 
the region | S94; | S h/2, (¢ = 1, ---,m— 1). We call such a region a region 
of strong convergence. 

When there is only one sequence defined by a non-linear difference equation 
that has a zero limit, it may be the trivial one S, = 0 for all n. In fact, it is 
always such a sequence unless the equation has a term of zero™ degree in the 
arguments S,, Snai,---, Snzm. In the future we exclude such sequences. 

The hypotheses of Theorem 1 exclude all the difference equations which do 
not contain both the linear terms apS, and GmSnim, (do ¥ 0, dm #0). We 


have the following result concerning those in which a = 0. 
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TueoreM 2. If in a non-linear algebraic difference equation a coefficient of a 
linear term is of higher degree in n than the coefficients of the non-linear terms, and 
all solutions of the difference equation of the linear terms approach zero as n becomes 
infinite, then there are sequences defined by the given equation that approach zero 
asn— ©, 

Proor. When all solutions of the difference equation of the linear terms 
approach zero all roots of the characteristic equation must be less than one in 
absolute value, hence adm * 0. Some or all of the other a;’s may be zero. If 
a ~ 0, this theorem is just a special case of Theorem 1: hence we assume that 
a = 0. That is, we assume that the difference equation has the form 


Galblave + Gn-thasn-2 + ~~ + 4S n+k + o1(Sn4m ett?» Sa ’ n) 


12 
a +am'=0, k>0. 
We replace S,4m in ®; by one of its values 

Sato = S(Sn+m-1 nr ae Sn ’ n), 


which vanishes at the point 1/n = 0, Snima+ = 0,--- , S, = 0, and then set 
u(n) = Spr,--+,Um(n) = Snim1. This gives the system of equations 


us (n + 1) = wi4i(n) (¢ +1, 2,.--.m-—1) 


(13) um (n +1) = — = [axtnss (n) he xis te teal 


+ % (f, U1 (n), 7+ Um (n), n) + an] 


When the non-zero roots of the characteristic equation of the equation of the 
linear terms of (12) are distinct, we apply the linear transformation 
ui(n) = 2,(n) (¢ = 1, 2,--- ,k) 
(14) wim) = Gingi2egi(M) + Gi,nzoteze(m) +--+ + Gi,me%m(N) 
(¢=k+1,---,m) 


where the a;; (7, 7 = k + 1, ---,m) are so determined that the transformed 
system of difference equations has the form 

(15) zi(m + 1) = 2i41(n) (@ = 1,2,---,k) 
zi(n + 1) = pizi(n) + Vi(a(n), oe Zm(N), n) (2 = k + 1, 9 lel , Mm). 


In order to clarify the proof we divide it into two parts. First we assume 
that the V; do not involve n explicitly. Under this assumption, for a given 
6 > 0, however small, it is possible to choose h so small that 


| Wi(ai(m), --- , 2m(n)) | < d{la| +--- + |em]} (¢ =k +1, ---,m) 


when |z;| S$ h(i = 1,2,---,m). Therefore 
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sln +1) | < | poll eet) | + Sf] 2n(m) | + +++ + Lem(n) |} S (pe | maya 
(¢=k+1,---,m). 


If |p;| 2 lol, G@ # 1), we set | p;| + mé = o and choose 6 so small that 


o, < 1. Then, 
lz(n+1)|<oh<h (@=k+1,---,m). 
It follows from the equations (15) that 
|e(n+1)| Sh (§ = 1,2,.--,k), 
and 
| x(n + 2) | < oyh, 
hence 
| z:(n + 2)| < oh (¢ = k, ---,m) 
|z(n + 2)|<h (¢ = 1, 2, ,k — 1), 
and so on until 
|z(n +k +1)| < oh (¢ = 1,---,m). 
In general 
| zi(n + lk +1)| < oth (¢ = 1,---,m). 


Hence the z;(n + lk) (¢ = 1, --- , m) approach zero as | becomes infinite, conse- 
quently, there are sequences of (12) which approach zero as n becomes infinite. 
Now if n appears explicitly in the V; , for a given 6 > 0, we may choose h so 


small and m so large that 
(16) |Wi| < (| a(n)| +--- +|2m|) + M/n (¢ =k+1,---,m) 
for |z:| <A (i = 1,---,m) and” = m. Hence for the initial conditions 
| zi(mo) | < h, 
| 2i(no + 1) | < | pi || 2s(mo) | + (| 21(mo) | + --- + | 2m(mo) |) + M/no 
< (lpi| + mi)h+M/no ((©=kK+1,---,m). 


If |p;| = | p:| for i ¥ 7, we set | p;| + mé = o; and choose 6 so small that 
o, <1. After m and h have been determined so that (16) is satisfied, we in- 
crease M until 0, + M/hno = o2 < 1. Then 


| zi(m + 1)| < och + M/m < och <h (@=k+1,---,m) 
and 


|2i(m +k +1)| < oh + M/m < ooh (¢ = 1,-.--,m). 
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The z;’s may never exceed h in absolute value, and when 1 is so large that 


M 
nase *™ 


— es 


we have 
| 2: (mo + 2k +1+1)| <lpille| +o(la| +--+ + lem) 


+" <oih (i =1,...,m), 
Continuing step by step we see that the z; (¢ = 1, --- , m) are bounded by an 
expression which slowly approaches zero as n becomes infinite: hence they 
approach zero. 
When the characteristic equation has equal roots, we employ a transforma- 
tion (14) in which the a;; (7, 7 = k + 1, ---,m) are so determined that when 
pi = pi-1 the equation of the transformed system is of the form 


(17) zi(n a 1) = pizi(n) a €2;_-1(”) a W (21 92> °** y hm, n), 


where ¢ is chosen so small that | p;| + «€ < 1, |p;| 2 | pil, 7 ¥ Jj, and the 
remaining equations are of the form (11). Treating this new system by a 
method analogous to the one used above, in which | p;| + € plays the role 
corresponding to that of | p; |, we see that it has solutions which approach zero. 
Hence our proof is complete. 

Theorems 1 and 2 exclude difference equations in which the degree of n in the 
coefficient of the non-linear terms is higher than the degree of n in the coeffi- 
cients of the linear terms. In regard to such equations we have the following 
theorem. 

THEOREM 3. An algebraic difference equation, whose terms are of at least first 
degree in the Sy: , defines sequences which approach zero as n becomes infinite, 
if all roots of the characteristic equation of the equation of the linear terms are less 
than one in absolute value 

Proor. As in the proof of Theorems 1 and 2, we replace S,4m by one of its 
values that vanishes at the point S, = 0, --- , Sn4m—1 = 0, and then we apply 
a linear transformation which transforms this new equation into a system of 
equations of the form (11), (15) or (17). 

CasE I. The coefficients of the linear terms in the S,4; are constants. 

When all roots of the characteristic equation are distinct and different from 
zero, the system has the form 


zi(n + 1) = piz;(2) + Wi(z(n), pha te » 2m(N), n) 
where all terms of the ¥; (¢ = 1, --- , m) are of at least second degree in the zi. 
Hence there exists an h such that 
| Wi(ti,--+,2m,n)| < Mn'([ai| +--- +l em{)? (¢ = 1,---,m) 


when | 2;| < h (¢ = 1, --- ,m), where M and lare constants. If | p;| 2 | pi | 
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< (| p;| + Mnim?o”™)o™ < «7 (¢=k+1,---,m) 
|zi(m0 + 1)| So” (¢ = 1,2,---,k): 
|2:(mo + 2) | S | pj || 2: | + M(mo + 1)'m70?"? 

S 0"(| p;| + Mm*(um)'o™) S 0" (¢ = k, +--+, m) 
|2i(m + 2)| So” (¢ = 1,2,---,k — 1): 
and 


| 2:(no +k + 1) ls | p; || z: | + M(m 7 k)'m 2 gtineth) 
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for all 7, we choose a number o such that | p;| < o < 1, and then choose no 
so large that 
| m*Mnoo" | + | p;| So 
Such a choice of no is possible since lim n'a" = 0, if |a| <1. Now for 2, in 
n->o 
the region |z;| S A So" (¢ = 1, --- , m) 
| 2:(mo + 1) | S | ps || 28] + Mngm'o’”® 


(| pi | + Mnim’c")o”® <s oot! (¢ = : 2, int , m) 


| zi(no + 2) | S | ps |lze(mo + 1) | + M(mo + 1)'m*o%"0*? 


lA 


S (| pi | + M(mo + 1)'mio"*)g"9" 
< (J pi| + Mnim?s™) ot! < gt? (i = 1,---,m), 
and in general 
| 2e(mo + p)| < 0"? (¢ = 1,2, ---, m). 


Hence the z;(m) + p) approach zero as p becomes infinite, and consequently 
{S,} ~Oasn —> &, 

When some of the p; equal zero the transformed system of equations is of the 
form (15). Again we choose o such that | p;| < o < 1, but now we take m 
so large that 


mM (uno)'s”® + | p;| S o, where u is the maximum of k and 2. 
Then for | z;:(mo) | S o”° (¢ = 1, --- , m) 


| 2i(mo + 1) | < | ps || 2: | + Mnim 2 52n0 





S (| ;| + Mn*(um)'o")o"® = 0 (i = 1,---,m). 


Consequently, 


| 2:(no + k 4 2) | < |p; || 2 | + M(no ry k fe 1)!m2q*noteth) < gn" 
G@=k+1,---,m) 
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| 2:(mo + 2k + 1)| So” (§ = 1,..., m), 
Continuing step by step we have 
| 2:(mo + pk + 1)| S$ 0"? (§ = 1,.-.,m). 
Hence the z;(m + pk + 1) approach zero as p — ©, and the corresponding 
sequence {S,} ~Oasn— &, 
As in the proof of Theorem 2, the above treatment, with suitable modifica- 
tions, applies to the case where the characteristic equation has multiple roots. 
Case II. The coefficients of the linear terms in S,4; are polynomials in n,. 
The transformed system of equations may be written in the form 
zi(n + 1) = Ziy1(n) (§ = 1,..-,&) 
(18) 2(n + 1) = pizi(n) + ba(l/n)a(n) + --+ + Bim(1/n)em(n) 
+ WV (2i(n), oe » Zm(2), n) (7 =k + 1, a hao m), 
where the b;; are convergent series in 1/n, whose constant terms are zero, and 


all terms of the VW; (¢ = k + 1, ---,m) are of at least second degree in the 
arguments z; (1 = 1,---,m). We first choose ; so large that 


|p;| + mb =p <1 


where b = | 6;;(1/n) | (7,7 = 1,--- ,m) forn > m. Then the argument in 
case I holds where | p; | is replaced by p and m is chosen greater than n,. Hence 
our proof is complete. 


2. Sequences with finite limits. The only possible finite limits for the se- 
quences defined by an algebraic difference equation, with constant coefficients, 


(19’) B(Spim, Sntm—1, +++, Sn) = 0 

are the roots of the algebraic equation 

(20’) @(S, S,---,S) =0, 

where the latter is obtained from the former by replacing S,4; (¢ = 1, --- , m) 


by S. Likewise, the only possible finite limits for the sequences defined by an 
algebraic difference equation 


(19) ®i(Srim,--:,Sn,n) = 0 

are the roots of the equation 

(20) (S,---, 8) = 0, 

where ®; = lim n? #,(Snim,---,S,, m) and q is the highest power of n 


appearing in %. 
If S = ais a root of the equation (20), and if the difference equation 


(21) B(Srim,--+, Sn, n) = 0, 
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obtained from (19) by the transformation S, = S, + a, satisfies the hypothesis 
of theorems 1, 2 or 3, then (19) defines sequences which approach a. For the 
sake of brevity, we shall call the difference equation (21) the “transformed 
equation.” 

TuroreM 4. If a is an algebraic number, then there exist algebraic difference 


equations that define sequences which approach a.° 
To construct one: it is sufficient to take an algebraic difference equation, 


involving arbitrary parameters in such a manner that the equation (20) has a 
root a, and then to choose values for the parameters so that there are sequences 


defined by the transformed equation (21) which approach zero. 
To illustrate the method, we construct a difference equation of second order 


that defines sequences which approach B', where @ is a rational number. The 
difference equation 


(22) ASnSny1 + bSni1Sn42 — CSn41 — (6 +a — c)B = 0 


may define sequences which approach 6° or —8', since its limit as n > @ is 
s° =. Applying the transformation S, = S, + 8, we obtain for the equation 
of the linear terms of the transformed equation 


(23) bSny2 + (b + a — 2c)Si41 + aS, = 0. 


If we choose a, b and c such that the roots of the characteristic equation of (23) 
are less than one in absolute value, there are sequences of (22) which converge 
to 6. For example when a = 1, b = 6, c = 6, the roots of the characteristic 
equation (23) are 1/2 and 1/3, hence some sequences of (21) will converge to £'. 
In particular, the difference equation 


(24) SrSn4 + 6Sn4iSnse = 6S8i41 > 2 = 0 


defines sequences which approach +/2. For the initial values Sp = 1, S; = 2 
and So = 1.4, S; = 1.4 the sequences are 


1, 2, 7/6, 9/7, 1021/756, --- ; 
and 
1.4, 1.4, 59/42 = 1.4044, 34908/24780 
= 1.4087+, 4804998967 /3395410200 = 1.4122+, .-- 


respectively. If instead of the above values we take a = 1, b = 54, c = 35, 
8 = 2, we obtain the difference equation 


(25) 54S nyiSng2 — 35S%41 + SnysSn — 40 = 0. 


* The truth of this theorem follows immediately from the work of Schréder (4) on the 
approximation of the roots of an algebraic equation by the iteration of rational functions. 
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For the initial conditions Sp = 1, S; = 2and So = 1.4, S; = 1.4 the sequences are 
1, 2, 23/38, 30647/22356 = 1.326+, 1.4052+, --- 


and 
1.4, 1.4, 1333/985 = 1.4105+, 48074279/34011495 = 1.41344, ... 


respectively. 

The sequences defined by (25) converge more rapidly to the +/2 than the 
corresponding sequences defined by (24). The question arises, is there a differ- 
ence equation whose sequences approach a given algebraic number more rapidly 
than those defined by all other difference equations? Under the conditions for 
which the generalized Poincare theorem” holds, the smaller the absolute values 
of the roots—all greater than zero—of the characteristic equation of the equation 
of the linear terms the more rapid the convergence to zero of the solution of 
the difference equation. Hence we were led to predict the following theorem. 

THreoreM 5. Of the family of non-linear difference equations of the form 


Am(N) Sn4m + Om—1(N) Sn4-m—1 + i + a(n) Sp 
+ P(Srim, i S, ’ n) ‘aa A(n); 


where ® is a fixed polynomial whose terms are of at least second degree in the Sn+ , 
the a(n) (¢ = 0, 1, --- , m) and 6(n) are polynomials in n, n(n) 4 0, and the 
coefficients of dm(n) are fixed while those of the a;(n) (¢ = 0,1, --- ,m — 1) and 
6(n) are arbitrary; the one that defines sequences which converge most rapidly to 
zero, for arbitrary initial conditions from a sufficiently small region surrounding 
the origin, is the one in which the ai(n), (¢ = 0, 1,---,m — 1) and @(n) are 
identically equal to zero. 

Proor. (a) First, 0(m) must vanish. When 6(n) ¥ 0 the sequences cannot 
approach zero more rapidly than n“, where k is an integer. For if this were 
true, the left member of (26) would approach zero more rapidly than the right 
member which is impossible. The proof of Theorem 3 reveals that there are 
equations of form (26) which define sequences that approach zero as rapidly 
as Mn'c”. Therefore if the sequences have the maximum rate of convergence, 
O(n) = 0. 

(b) Second, the degree of n in a,(n) must be greater than or equal to the 
degree of n in all the other a’s. Otherwise, there is no region of strong con- 
vergence. 

(c) Third, if the coefficients of the terms of highest degree in n of ao(n), --- ; 
Am(N) are Go, G10, --- ,4mo, respectively, then the difference equation of the 
family whose sequences converge most rapidly to zero is the one for which 
Amo ~ O and aw = 0, (¢ = 0,1, ---,m — 1). 

To prove this we refer back to the notation and the statements in the proof 
of Theorem 3. When there is one aj that is different from zero there is at least 
one p; that is not equal to zero. It is only necessary to consider the case where 
all of the p; are less than one in absolute value. For it is possible to show, by 
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a slight alteration of the inequalities in the proof of Theorem 3, that of the 
sequences defined by two difference equations, identical except for the p,; of 
maximum absolute value, those defined by the difference equation with the 
smaller p; converge more rapidly than some of those defined by the other equation: 

So we assume that 1 > | p; | > | p:| (¢ ¥ 7), and consider the equations (18). 


We choose no so large that 
mM (un)'o" < 4(| p;|/3)"™ 


<lal iat (|x| | or |3 1,7 = 1, +++, m) 
< &/2 when pi = pi-1,; 


where ¢€ is the e of equation (17); 
m | bi;(n) | S mb S 3(| p;|/3)""; 
pi + mb = p <1; 


and 
mM(un)'o” +p<o<1, nem. 


When the initial conditions are | z;(m)| < o”°, the last two conditions are 
sufficient to insure that the sequences approach zero such that 


| 2:(mo + pk + 1)| < 0, 


When z;(m) = o”° and | 2:(m) | S o”°, (¢ = 1, 2, --- , m) we have in succession: 


ema + 1) | & |r) |p| — © zene) | mind'| © | ee(me) ] 


IV 


r= 


| z:(mo) | [les] — (10; |/3)"*"] = 2;(n0) (2| p; |/3), 


IV 


| 2;(mo + 2) | = | 2;(m + 1) | |p; | 
— b>) | 2-(mo + 1) | — M(t + 1)'[D | 2-(r + 1) |]? 











1. |\m+l 
= Laine + Dif orl — (122!) ] & Lesem + 01.72), 
\2i(ro + 1) | & | 2,(n0) ie; , ( = 2,3, ---,m+1) 


Under the above restrictions, 


| zen) | 2| ;|\™ 
ey = (22!) ty 


IV 
S 
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Thus, 

| z;(mo + m + 2) | 

| z;(mo + m + Dif le — {b — M(m +m + 1)" D | 2-(t + m + 1)|] 


2 Lee(%m + m + H 
| 2;(mo + m + 1)| 


IV 





= |z;(no0 + m + 1)| [ pi| — (| p;|/3)"™ GoM 


> | z;(mo +m + 1) ((? al) 


IV 


3 


IV 


2;(no) (2| p;|/3)”™*. 
Continuing, we obtain 
| 2i(mo + m + 1) | 2 2;(m0)(2| p;|/3)"" = 0,1,---. 


Therefore, if some p; is different from zero, some sequences defined by the 
difference equation converge to zero more slowly than the sequence (2 | p; |/3)”. 
While if p;5 = 0, ¢ = 1, 2,---,m, all sequences, with initial conditions 
| z:(mo) | S o”°, converge to zero more rapidly than the sequence (| p;|/3). 
Thus for the most rapid convergence of all sequences, with arbitrary initial 
conditions, ai = 0 (¢ = 0, 1,---,m — 1). 

(d) Fourth, the a;(1/n) = 0 (¢ = 0,1, ---,m— 1). For assume this state- 
ment is false and set S, = n*"7', , where d is so determined that in the new 
equation one other linear term is of the same degree in n as d»(n). The quan- 
tity \ will be negative. Remove the common factor n>”. Now, although this 
new equation is not an algebraic equation, the inequalities are stronger than 
they were for the proof of (c) above. Therefore the argument of (c) applies. 
Hence by a repetition of the argument if the sequences have the maximum rate 
of convergence a;(1/n) = 0 (¢ = 0, 1,---,m — 1). Q.E.D. 

Let us again consider the example. According to Theorem 5 there is a 
difference equation of the form (22) that defines sequences which converge more 
rapidly to +/2 than those defined by (25). It is the one for which a = 9, 
b+a-—2c=0. That is, the equation 


28n41Sn42 —a Shit -—-2= 0, 
or 


(27) 28,Seaun —- S —2=0. 


For the initial values S) = 1 and So = 1.4 the sequences are 


* * 
1, 1.5, 1.4166+, 577/408 = 1.414215+, 665857 /470832 = 1.414213562376-+ --- 
886731088897 /627013566048 = 1.4142135623730950 .-. +, --- 
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an 


* * 
7/5 = 1.4, 99/70 = 1.41428+, 19601/13860 = 1.414213564, 
* 
768398401 /543338720 = 1.4142135623730950499 + 


respectively. The asterisk indicates the first incorrect decimal of the terms. 
The sequences defined by this equation converge to the limit very rapidly. If 
we assume the error of the n™ term is « and set S, = 2! + e, then 


Sai = 2 +2? —2°F +... 
Hence, when S, is sufficiently close to the limit the error is approximately 


squared with each successive term of the sequence. We say that the order of 
rate of convergence of the sequences is 2. 

The difference equation (27) is of first order, and we know from the work of 
Schréder on iteration of rational functions that it is possible to set up a first 
order difference equation with constant coefficients, whose sequences approach a 
given algebraic number a with an order of rate of convergence k, where k is any 
integer. That is, there exists a first order difference equation with the property 


that when S, is replaced by a + e, then 
Sn =atae toe +.-., 


where c; is a constant independent of «. (If k = 1, |a| < 1). 

Is this true for difference equations of higher than first order? 

THEOREM 6. For a given integer k and an algebraic number a, there exist alge- 
braic difference equations of the m‘ order that define rational sequences which 
approach a with an order of rate of convergence k. 

Proor. Let 


(28) Snim = B(Sn, +++, Sntm—-, 1) 
be an algebraic difference equation, where ®, a rational function in its arguments 
Si, +++, Sn4m—1, 2, has the Taylor’s expansion 

© = a + a(S, — a) + an(Sny1 — @) +--+ + Qima(Sntma — @) 


(29) } 
+ ar0(Sn — a)” + de0(Sn4i1 — @)(Sn — a) +--- 


which converges in some neighborhood of the point S, = a, --- , Snim—1 = @, 
n= 0, (The a’s are rational functions of n.) 
There are sequences defined by (28) which converge to a with an order of 


rate of convergence k, if, when S, = a + €, Say = a + af, --:: fs 
a+ a gaits. where the 6; (¢ = 1, --- , m — 1) are constants independent of e, 
then Spam = a@ + Ome” + cgeh"t! + ..... This is true, when a = a and 
%i,,...,;, = O for all values of 1 for which 


(30) gt 4 £9 4. thet ck (4; , t2, -++ 4 < m) 
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and when ayi,,...,:, # 0 for some values of i; , --- , 7:such that k* + kh +... 4 
k'' =k". Hence, if there exists a rational function ® such that ®(a, --- , a,n) = 
a and 


; Arvoets, 
giitee Im @ 


97! AS22,1 +++ ASe%m—a la 


Il 





0 
for all 7’s such that 
ju + hijn + js + ee +R im < ke" 


then the theorem is true. 
We state that if a is a simple root of an algebraic equation f(z) = 0, then 


(Sn, Sasi, -°*, Sntm—1, 7) 
1 matt fS)( 1 at Ks 
- 3[s.+ X { sien . (7i55 =) Fis f 4 (S.)oo(s.,1)] 
+ 1T Sen + a8 {(— 02 Bee ( hal. ) — 


j=l i! \F' (Sax) dSnusJ f"(Saus) 
+ Pr" "Giadbs (Sen : ‘)| 


| ef 
t+ .ee + B | Surm- + > x 1) i! S' (Sn4m—1) GSnim- 











is a function’ satisfying the above conditions, where the ¢; (i = 0,1, --- , m — 1) 
are arbitrary polynomials which are finite at the point Snr,; = a (¢ = 
1,---,m — 1) andn = ~@, f’ is the first derivative of f with respect to its 
variable, f’ is the j*" power of f, and ((1/f’(z)) d/dz)’ is a symbolic operator 
which subjects 1/f’(z) to the operation of differentiation and multiplication j 
times. For example 


(jae) p ~ pac paetp aelp ap) b) 


First, ®(a, a, ---,a,) = a, for when Sry; = a, (¢ = 1,---,m — 1), all 
terms, except the first in each of the brackets, contain a factor f(a) and hence 
vanish. Second, the k” — 1 partial derivatives with respect to S, vanish at 
the point S,4; = a, (¢ = 1, --- ,m — 1), since the derivatives of the expressions 
in the last m — 1 brackets are zero and Schréder has proved that the k” — 1 
derivatives of the expression in the first bracket vanish when S, = a. Simi- 
larly, the k"* — 1 partial derivatives of with respect to S,4; vanish when 





’ This is a generalization of a formula developed by Schroder. 
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And lastly, all second partial derivatives with respect to two different 


Sn4t = a. 
S,4i Vanish identically. Q.E.D. 
Assumption. In order to assure that the sequences under consideration are 


rational we shall assume, in the future, that S,,,, enters the equation only in 


the first degree. 


3. The effect of the order of a difference equation and the degree of n in the 
coefficients upon the order of rate of convergence of the sequences. It follows 
from the work of Schréder, and from the above theorem, that it is possible to 
increase the order of rate of convergence of sequences, defined by difference 
equations, by increasing the degree of the equations. Now we ask the ques- 
tions: When the degree is held constant, how does a change in the order of the 
difference equation affect the order of the convergence of the sequences? What 
is the relation between the degree of n in the coefficients of a difference equation 
and the order of convergence of the sequences? 

As a partial answer to the last question we have the following: 

TurorEeM 7. The order of rate of convergence, r, of rational sequences defined 
by a first order difference equation whose coefficients are polynomials in n, cannot 
exceed the order of convergence of rational sequences defined by some first order 
difference equation of the same degree with constant coefficients, provided r = 2. 

Proor. Again without loss in generality, we consider only sequences which 
approach zero. Thus reducing our problem to that of showing that, “if S, is a 
null sequence defined by a difference equation of order one and degree q, then 
the order of rate of convergence r of S, is <q + 1 and it is equal to q for some 
equations of this type which have constant coefficients.” 

When the coefficients are constants, the maximum rate of convergence of the 
sequences defined by first order difference equations of the g‘" degree is qg. If 
the sequences do converge to zero with this maximum rate of convergence then 
the Taylor’s expansion of the left member of the equation 


Siz = &(S,), 
gives 
Srq + al agSn + A483" + ak 


where ag ¥ 0. If the coefficients of & were polynomials in n then the a; would 
be rational functions of n. In order for the sequences of such an equation to 
have the maximum rate of convergence a; = 0, (¢ = 1, --- ,q — 1); a,(n) £9, 
and a,(n) must be a rational function of n in which the degree of the numerator 
is less than the degree of the denominator, viz. 


a(n) = ayn™ + ayn? 4+ --- (A > 0), 
and the equation may be written in the form 


(31) Sata = m*SA[(ay + aay) + +++) + nragyr(n)Sn + magy2(m) Sp + ++ -1. 
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Hence if the order of rate of convergence of the sequences may be increased by 
allowing the coefficients to be polynomials in n, the order of convergence of the 
sequences of (31) must be greater than that of the sequences defined by the 
equation 


Sn4i = a,S% . 


That is, the order of rate of convergence must equal or exceed the order of con- 
vergence of the sequences of 


(32) Ta = aT”. 


Now when the sequences converge with an order of rate 2 2, we may choose mn 
so large and then S,, so small that 


ay" | ayyin™ + anyon” +---| < €/2 < }, n> mM, 
Max’ | dg4i(N)Sn + Agy2(n)S, +++» |< e/2 <4, n>, 


with such a choice the sequences defined by (31) are term by term greater in 
absolute value than the sequences defined by the difference equation 


(31’) Sr = nal — &) 8%. 
The solution of this equation for 8,, = Sn, is 
| on - S? af? ae dp 4. te ~ 1)"(p +4: Nm — >} ied a (mm) 


and the solution of (32) is 


— _(atl)?—)D/¢ a(atl? 
T'not+p = A aS . 


But 


Tn Pp. 
a ( ee = lim g.)*" “Hg letra lary hol or "'%, + —1) 


po \Snoip) pre 


(p+ + 2)™ +++ (mo) = 0. 


Therefore the sequences of (31’), and consequently those of (31), do not converge 
to zero as rapidly as those of (32), so they cannot have an order of rate of con- 
vergence q + 1. Q.E.D. 

Although we have been unable to generalize Theorem 7 we should like to 
conjecture that it holds for m“” order equations. 

Now let us concentrate upon the relation between che order of difference 
equations and the order of the rate of convergence of their sequences. 

TxroreM 8. There exist algebraic difference equations of higher than first order, 
that define rational sequences which converge to certain algebraic numbers with a 
higher order of convergence than the sequences defined by any first order difference 
equations of the same degree. 

Proor. Take the general equation of the m™ order and g™ degree with con- 
stant coefficients, 


q m—1 m—1 
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or 
q m—1 m—1 m—1 
2, z i, a) > : (Coisig---é; Snti: Sntig ving Sn+i;) + Co 
€ — = = etj- 
(33) Sn4m ‘gieiend q m—1 m—1 m—1 —_—_— 
> x > eee , x (Cos ég---¢; Sngsy Sati, eee Snsi;) ca Com 
cm tare Seat i-124j-2 


and demand that it define sequences which approach the number 6”? with an 
order of rate of convergence k, where k is an integer to be determined, 8 is a 
rational number, and §”" is irrational (j = 1,---,q— 1). That is, set S,4; = 
B+ g,e" (j = 0,1, ---,m — 1), expand the right member of (33) in powers 
of «and demand that the constant term equal 6“, and that the coefficients of ¢' 
vanish fort = 1,2, ---,k" —1. These demands impose k” linear homogeneous 
relations upon the coefficients of the given equation (33); as can readily be seen 
by dividing the numerator by the denominator. The k” ijinear relations may 
contain some of the 6; (¢ = 0, --- ,m — 1) and some fractional powers of 8. 
In order to insure that the coefficients of the difference equation (33) be rational, 
the coefficients of 6; and the fractional power of 8 in these linear relations must 
vanish. 

The maximum value for k is determined by the number of the relations that 
the coefficients of (33) may satisfy, that is, by the number of coefficients. This 


y . 
number’ is 





yy mn a ee Ny. 


q 
Since, when m = 1 it is >> [2] + 1 = 2q + 1, the coefficients of a first order 
j=1 


difference equation may be chosen so as to satisfy at least 2¢ linear homogeneous 
relations. Therefore k may be at least 2, for when k = 2 the coefficients are 
only restricted by 2q relations, q relations which insure that the constant term 
in the power series in ¢ equal 6’/“ and q relations which insure that the coefficient 
of eequal zero. Hence if a difference equation of higher than first order defines 
sequences whose order of rate of convergence, k, is greater than that of sequences 
defined by a first order difference equation, k must be at least 3. 





*In order to see this, write 


m—1 m m—1 m—1 m—1 m—1 
Ee e-Ee Y+T ¥ 
4:=0 t;2tj-1 t,;=0 ij2tj-1 i;=0 tj-12%tj-2 


The number of terms in the first sum on the right is m(m + 1) --- (m+ j —1)/j!. There- 
fore, the total number of terms of j** degree is 
m(m +1) +--+ (m+j —1) 4 mm +1) -++ (m+ j — 2) 

j! (j -—1)! 





m(m +1) --- (m+ 7 — 2)(m + 27 — 1) 
j! ; 
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Hence if the order of rate of convergence of the sequences may be increased by 
allowing the coefficients to be polynomials in n, the order of convergence of the 
sequences of (31) must be greater than that of the sequences defined by the 
equation 


Sri = a,S% . 


That is, the order of rate of convergence must equal or exceed the order of con- 
vergence of the sequences of 


(32) Tay = TS”. 


Now when the sequences converge with an order of rate 2 2, we may choose nm 
so large and then S,, so small that 


ax | axyin + anyon +---|<e2<}, n>m, 
nay” | Ag4i(n) Sy + dg42(n) Sp sed | < e/2 < 3, n>, 


with such a choice the sequences defined by (31) are term by term greater in 
absolute value than the sequences defined by the difference equation 


(31’) Buu = nay(l — S82. 
The solution of this equation for S,, = Sn, is 
Snore = Sno” (1 — 6) "(p + mo — 1) (p+ m9 — 2)" «+ (m9) 


and the solution of (32) is 


— ,((atl)?—D/@q glatlP 
TV nets —_ ay ed . 


But 


Tn p : 
in ( mi = lim “.j** a? g hat DP—aP) 1474 pi lai + no —1) 


po Snot po 


(p+ mo + 2) «++ (mo) = 0. 


Therefore the sequences of (31’), and consequently those of (31), do not converge 
to zero as rapidly as those of (32), so they cannot have an order of rate of con- 
vergence g + 1. Q.E.D. 

Although we have been unable to generalize Theorem 7 we should like to 
conjecture that it holds for m“” order equations. 

Now let us concentrate upon the relation between the order of difference 
equations and the order of the rate of convergence of their sequences. 

THEorEM 8. There exist algebraic difference equations of higher than first order, 
that define rational sequences which converge to certain algebraic numbers with a 
higher order of convergence than the sequences defined by any first order difference 
equations of the same degree. 

Proor. Take the general equation of the m‘ order and g™ degree with con- 
stant coefficients, 


q m—1 m—1 


p > oe pm cee pm (Cos,iq.--4, Sn4i, Sri eee Sn+i;) + c% = 0, 


7=1 i1=0 ig>%) tj2tj-1 
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or 
q m—-l m—1 m—1 
ee th ll iy ica Bara => ang) 0 
(33) Snim = — Aah tse teh) “i i=! pein 
2d ¢,=0 x, oe As he (coisss---6; Setis Suri a Sn+ij) + Com 


and demand that it define sequences which approach the number 6'* with an 
order of rate of convergence k, where k is an integer to be determined, £ is a 
rational number, and @”"" is irrational (j = 1,---,¢— 1). That is, set S,.; = 
B+ 0,6" (j = 0,1, --- ,m — 1), expand the right member of (33) in powers 
of «and demand that the constant term equal 6", and that the coefficients of ¢ 
vanish fort = 1,2, ---,k" — 1. These demands impose k” linear homogeneous 
relations upon the coefficients of the given equation (33); as can readily be seen 
by dividing the numerator by the denominator. The k” linear relations may 
contain some of the 6; (¢ = 0, --- ,m — 1) and some fractional powers of 8. 
In order to insure that the coefficients of the difference equation (33) be rational, 
the coefficients of 6; and the fractional power of 8 in these linear relations must 
vanish. 

The maximum value for k is determined by the number of the relations that 
the coefficients of (33) may satisfy, that is, by the number of coefficients. This 
number’ is 





fone +1)... (nti— D4 ¥—D | 


j= j 


q 
Since, when m = 1 it is >> [2] + 1 = 2q + 1, the coefficients of a first order 
j=1 

difference equation may be chosen so as to satisfy at least 2¢ linear homogeneous 
relations. Therefore k may be at least 2, for when k = 2 the coefficients are 
only restricted by 2q relations, g relations which insure that the constant term 
in the power series in € equal 6!“ and q relations which insure that the coefficient 
of eequal zero. Hence if a difference equation of higher than first order defines 
sequences whose order of rate of convergence, k, is greater than that of sequences 
defined by a first order difference equation, k must be at least 3. 





*In order to see this, write 


m—1 m m—1 m—1 m—1 m—1 
ote, Cay Ti es. 
#;=0 t;2tj-1 4;=0 ij2tj-1 i,;=0 tj-12%j-2 


The number of terms in the first sum on the right is m(m + 1) --- (m +j — 1)/j!. There- 
fore, the total number of terms of j degree is 
m(m +1) +++ (m+j—1) | m(m+1) --+ (m+j — 2) 


j! * G-1! 





m(m + 1) +--+ (m+j — 2m + 2j — 1) 
j! ; 
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Now consider difference equation of the second order. When m = 2 the 
number of terms in (33) is 


Lj t1ti= Mt) pq41=@ 40" 


If a sequence converges to 6!" with an order of rate of convergence 3, the coefii- 
cients must satisfy 18q relations—gq relations to make the first term of the series 
in ¢ equal B'", 2g to make the coefficients of « and ¢ vanish; 6¢ to make the 
coefficients of é, é, e& vanish (twice as many here since 6; may be irrational and 
may appear in each of these relations); and 9g to make the coefficients of ¢, 
é and é vanish (6, and 6; may enter in each of these three relations). Since 
there are (q + 1)’ coefficients in the difference equation, if g = 17 these 18% 
relations may be satisfied, for then the number of homogeneous linear equations 
is less than the number of undetermined coefficients. Hence, for g = 17 there 
exists a second order difference equation which defines sequences that approach 
8" with an order of rate of convergence 3. 

Now if we can prove that no first order difference equation of 17th degree 
may define sequences which approach 8” with an order of rate of convergence 
3 our proof is complete. It follows from Theorem 7, that there is no loss in 
generality if we only consider equations with constant coefficients. The general 
first order difference equation of 17th degree is 


17 ’ 
2, aS 
Bugs 2 ene 


16 Ps 

> 3,83, 

j=0 
If this equation defines sequences which converge to 6” with an order of con- 
vergence 3, when we set S, = 6” + ¢ and expand the right member in powers 
of ¢, the constant term will be equal to 6” and the coefficients of ¢ and ¢ equal 
to zero, viz. 


dy + (a1 + bo)BY + (az + by)B + --- + (ay + du)B = 0 
ay + (ay + bi)" + (Bag + 2bs)8" + --- + (17a + 16b.)8" = 0 
dy + (Bas + be)B"” + (6a, + 3bs)B"” + .-- + (136017 + 120b:6)8'"" = 0 


If the a’s and b’s are to have rational values they must satisfy the following 
equations 


a+b =0, a+th=0, ---,a6¢+ bis = 0, ao + (az + bis)8 = 9 
a=0, 2a2+6b,=0, 3a; + 2b, =0,---, 17ay7 + 16bi5 = 0 


a =0, 3a,+b =0, 6a,+ 3b; =0,---, 136017 + 120b.5 = 0, 
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which have the unique solution a; = 0, b; = 0 (¢ = 0,1,---,17) G = 

., 16). Therefore no such equation exists. Q.E.D. 

Next we show that the sequences defined by a first order difference equation 
may converge to an algebraic number more rapidly than those defined by all 
other difference equation of higher order and of the same degree. 

THEroREM 9. The difference equation of second degree that defines sequences 
which converge most rapidly to the irrational number 8° where B ws rational, is the 


one of first order. 
Proor. Consider the general second degree difference equation of the m™ 


order 

(34) Line 2 CriiN  Snzj Sagi + »» Cin Snyi tn ,| = 0. 
When it defines sequences that approach £, 

(385) ) 2 ewi8 + cust + 0 = 


and the transformed equation is 


l 
py n- te > Crij (Sas; Sasi + 8 Sn, + B +344) + > Cri 3] 


r=0 7=0 i>) 
(36) 
+ D0 {= D cni8 + > cast +a = 0 


r=1 7=0 i>] 


If a difference equation of the form (36) defines sequences which converge 
to zero with the maximum rate, then, according to Theorem 5 


—l Mm 


(35') ED cniB + De cas +c, =0 (r =1,2,---,)) 
7=0 i2) 
and 
(37) + etal + c,; =0 (j =0,1,---,m—1)(r =0,1,---,) 
i>) 
where ” 1 i sj 
. ae oe 
Therefore when #? is an irrational number and the coefficients of the difference 
equation are rational numbers, c,; = 0, (7 = 0,1, --- m — 1) (r = 0,1,--- , JD 
And since from (35) and (35’) + # Ci = 0 (r _ 0, 1,--- ’ l), Crm = 0 (r = 0, 1, 
i=0 
«+, OD. 


Now assume our theorem is false, then there is a difference equation of the 
oe (34) of higher than first order that defines sequences which approach 
6 with an order of convergence 2. Writing (36) in the form 
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l 1 m—1 m— 


Le > oe Crij(Sn4jSn4i + B’ Sui, + B* Susi) 





han n" i=0 iz) 
n ong \ —gieag l 1 m—1 ; ’ 
> os } 4 Crmi(Sn4i + B’) 
r= 7= 


9% 


when Sry: = G€ (¢ = 0,1, ---,m— 1) 


om 


we can see that if Snsm = Omé 
“Cri; = 0 (i,j = 0,1,---,m — 2)(r = 0, --- , I) 
Cr,m—2,m—1 = 0 (r si 0, 1, 2, oie ,l). 


These results combined with the equation (37) and (35) give 


Cm =O (f = 0,1,---,m— 2)(r = 0,1,---,]) 

and 
(Cr,m—1,m + Cr,m—1,m—1)B + Cr = 0 (ry = 0,1,---,)) 
2Cr,m—1,m—1 + Crym—izm = O (r = 0,1,---, J). 


Therefore the unique equation of the form (34) which defines sequences that 
approach 6! with an order of convergence 2 is 


y | ie. See _ | aid B = 0 
but this equation is equivalent to the first order difference equation 
2S8naiSn — £ = B = 0 Q.E.D. 


In summarizing we can only say that, in general, there is no direct relation 
between the order of a difference equation and the order of rate of convergence 
of its sequences. 


4. Minimum degree for a first order difference equation whose sequences 
approach 8" with a given order of convergence. 

THEOREM 10. There exists a first order difference equation of q* degree, q = 
kp, that define sequences which converge to B'”, where B is rational and B’” is 
irrational (j = 1, 2,--- , p — 1), with an order of rate of convergence 2k, and no 


first order difference equation of degree kp may define sequences which converge to 


UP 
8°” with an order of convergence greater than 2k. 
Proor. The transformed equation of the first order difference equation of 
th 
q. degree, 


4 ° 
7: >» a; Si, 
7=0 


(38) Sau = 5 ; 
> b; 8%, 
7=0 
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is 
q 
pe a(S, Se pi?) 
(39) Say = = — p”. 
D (8, — BY)’ 
7=0 
If the coefficients a; , b; (j = 0, 1, ---,m) (¢ = 0, 1,---,m — 1) may be 


determined so that the coefficients of Sj, in the numberator of the left member 
of (39) vanish (t = 0, 1, --- , r), then (38) defines sequences which approach 
3” with an order of convergence r + 1. Setting the coefficients of Si, (t = 
0, 1, ---, 7) equal to zero we obtain the following linear homogeneous equation 
in the a’s and b’s: 


1 
(a;8"” + b;8°°""”) + a8"? = 0 


7=0 


q—1 
(40) 2 i(a;B?”” + bj") + ga,8” = 0 


Since by hypothesis 6”” is irrational (j = 1, 2, --- , p — 1), and since the a’s 
and b’s are to be rational each of the equation (40) divide into p equations. 
Therefore, the demand that the sequences approach 8"” with an order of con- 
vergence 2k imposes 2kp linear homogeneous relations upon the 2kp + 1 coeffi- 
cients of (38). Hence there is always a solution. 

Assume that the second part of the theorem is false, then the a’s and b’s will 
satisfy the relations (40) when r = 2k, that is to say, they will satisfy (2k + 
1)p linear homogeneous equations. These (2k + 1)p equations divide into p 
systems of equation. One with 2k + 1 unknowns, 


ay + Bboy + Bay + B’dop-1 + Bary + --- + B dip + Ba, = 0 


am ' 2 2 
(’ ) ai +(f) vuie (rs ) sl +(?) po 


+ P ) bp + (7) ea = 


~~ | 2 1 2 
‘ 2k ) bb. + (z) Bap + ( J ) 6b + (:?) 8 dep + > 
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l 1 m—1 m— 


2 = de Crij(Sn4jSnzi + 8 Snai, + B*Sn4s) 





5 _ _ra0 0" j=0 izi 
nm ( aaa Dy mi ; ’ 
p a = Dy Crm(Sn4i + 8°) 
r=0 7=0 
we can see that if Spin = Ome When Sry; = O:e" (¢ = 0,1,---,m— 1) 
Crij = 0 (i,j = 0,1,---,m— 2)(r =0,--- ,]) 


= 0 (r = 0,1,2,---,)). 


Cr,m—2,m—1 
These results combined with the equation (37) and (35) give 


Crmj = 0 (j = 0,1,---,m — 2)(r = 0,1,---,]) 
and 


pr + ee + Cr = 0 (r = 0, 1, in ,l) 
ee + Cr,m—1,m >= 0 (r = 0, 1, rat , 1). 


Therefore the unique equation of the form (34) which defines sequences that 
approach 6' with an order of convergence 2 is 


28n4+mSn+m—1 seaaad ia = B = 0 
but this equation is equivalent to the first order difference equation 
28.8. — Sa —B=0 Q.E.D. 


In summarizing we can only say that, in general, there is no direct relation 
between the order of a difference equation and the order of rate of convergence 
of its sequences. 


4. Minimum degree for a first order difference equation whose sequences 
approach @”” with a given order of convergence. 

THEOREM 10. There exists a first order difference equation of q™ degree, q = 
kp, that define sequences which converge to B'”, where 8 is rational and p”” is 
irrational (j = 1, 2,--- , p — 1), with an order of rate of convergence 2k, and no 
first order difference equation of degree kp may define sequences which converge to 
8" with an order of convergence greater than 2k. 

Proor. The transformed equation of the first order difference equation of 
q*” degree, 
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is 


q 
pm a(S, — p?)’ 
(39) Baa = - — p’”. 


p> b(S, ie p"?)’ 





If the coefficients a; , b; (j = 0,1, ---,m) (¢ = 0,1,---,m — 1) may be 
determined so that the coefficients of S‘, in the numberator of the left member 
of (39) vanish (¢ = 0, 1, --- , 7), then (38) defines sequences which approach 
3°” with an order of convergence r + 1. Setting the coefficients of 5%, (t = 
0, 1, ---, 7) equal to zero we obtain the following linear homogeneous equation 
in the a’s and b’s: 


q-1 


> (a; gi!” 4 b; gothlr) $- Aq gi? ae 


i=0 


q—1 ; 
(40) 2d jase?” + b;6””) + qa,p*?” =0 
b— 


see eee eee eee eee e eee eee eeeeeeeeeeeseeeeeeeeeeanseees 


Since by hypothesis 6”” is irrational (j7 = 1, 2, --- , p — 1), and since the a’s 
and b’s are to be rational each of the equation (40) divide into p equations. 
Therefore, the demand that the sequences approach 6” with an order of con- 
vergence 2k imposes 2kp linear homogeneous relations upon the 2kp + 1 coeffi- 
cients of (38). Hence there is always a solution. 

Assume that the second part of the theorem is false, then the a’s and b’s will 
satisfy the relations (40) when r = 2k, that is to say, they will satisfy (2k + 
1)p linear homogeneous equations. These (2k + 1)p equations divide into p 
systems of equation. One with 2k + 1 unknowns, 


ay + Bbyp1 + Bay + Bboy + Bary + --- + B dip + Ba, = 0 








(77 ') sy +(?) a0, +(??; + ') 6 Nida +(? P) say + _ 
(41) 2 + ("= tY fbi + (2) sta =0 
 wichdehaaialada eet lcs 
("5") ns + (B) Boe + (2Paf ") tes + (2) tan + 
+ (Mz tY bay + (2) Ba, = 0 
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and (p — 1) of the form 


/, 


, ; : : , 
() b; + (? : ') Aju + ( 7 . Bbj+p + ? . : * ) BGj+p41 


+ eee + ‘4 * wi ail BX bis eae 


(42) 


b; + aj41 + Bi4p + Baspys +--+ +BY bj4G—yp + BGiu-ypy =0 


) k-—1 1 
+ (? + ( i )p + ) bt anieres - 0 





- a 1 . ° 1 
fi) b; + ( a ) dj + ( - Bbi+p + (? * 2%, ) Baj+p+1 


+} «0. > 4 ~ ” Be" bi40—1 


4 (? + (k _ + " a 


(j = 0,1, 2,---,p — 2). Wheng = 0 the last equation of the system may 
not appear but in all cases there will be at least 2k equations in each of the last 
p — 1 systems. Therefore, since all of the systems of equations have at least 
as many equations as unknowns there are no non-trivial solutions unless a 
determinant of one of the systems is zero. The determinants of the coefficients 


are all of the form 
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) C2) C2) C3") C4) CR) 
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Such a determinant is equal to 





) 





1 ] l 1 
, jg jt+l, itp jt+mp+1 
alt os G+ (j + mp + 1)°), 
J G+) G+pD) (j + mp + 1)’ 





which is different from zero, for it is a Vandermonde determinant. Therefore 
we have a contradiction. Q.E.D. 

It is not difficult to see that the equation of q‘® degree, ¢ = kp, which de- 
fines sequences that converge to 6” with an order of convergence 2k is unique. 
The solutions of the first 2k equations of the last p — 1 systems (42) are all zero 
and the solutions of the first 2k equations are of the form c;k. Hence, when the 
factor k is removed from the numerator and the denominator, we obtain the 


equation. 
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THEOREM 11. There is a first order difference equation of degree kp + 1 that 
defines sequences which converge to B''”, where B is rational and B*” is irrational 
(j=1,2,---,p- 1), more rapidly than those defined by any first order difference 
equation of degree q, where (k + l)p>q>kp+l. 

Proor. Consider the difference equation (38) forg = (k + 1)p — 1. First, 
assume that the sequences defined by this equation approach s'” with an order 
of convergence 2k + 2. Then the coefficients satisfy 2k + 2 relations (40). 
These 2k + 2 relations give (2k + 2)p homogeneous linear equations in the 
a’sand b’s. And the linear equations divide into p systems of 2k + 2 equations 
each. One of the systems contains 2k + 1 unknowns and the other p — 1 
systems each contain 2k + 2 unknowns. None of these systems have non- 
trivial solutions for the determinants of the coefficients are composed of columns 
of binomial coefficients which we proved, in the proof of Theorem 10 to be non- 
vanishing. Hence our assumption is false. 

Second, assume that the sequences have an order of rate of convergence 
2k + 1, then the a’s and the b’s must satisfy 2k + 1 relations (40). These 
2k + 1 relations give p systems of homogeneous linear equations (41) and (42). 
One system contains 2k + 1 unknowns and the other p — 1 systems contain 
2k + 2 unknowns each. The solutions of the first system must all be zero, 
while each of the other p — 1 systems may be solved for 2k + 1 of their un- 
knowns in terms of one of the others, for again, all determinants of the coefficients 
are different from zero. 

We construct a matrix of 2(k + 1) columns and (2k + 1)(p — 1) rows, from 
the coefficients of the p — 1 systems of equations, where the coefficients of a,8’, 
i418’, --+ , Qizpr6’ ve in the same column (i = l,p+1,---,kp+ i) and 
the coefficients 6,8’, bi418’, --- , br4p-18’ lie in the same column (I = 0, p, 2p, 
--»,kp). That is, the matrix 


] 1 1 
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Since the binomial coefficients obey the law 


(2)+()-(5) 


any one of the rows of this matrix is linearly dependent upon the first 2k + 1 
rows. Hence the rank of the matrix is 2k + 1. 

This tells us that if we solve the first system of equations for 2k + 1 of the 
unknowns in terms of a; that the second set of equations has the same solutions 
where the subscripts are increased by one, and so on. If we solve p — 1 sys- 
tems of equations in terms Of Gipy1, Gipy2, +--+ » €e41)p—1 Tespectively, then the 
given equation (38) takes the form 


p—l é 
oi(Sn) dX Akpti Ss, 
jam 








Say = = p—l : 
$2(Sn) > ArpriS, 
fo 

or 

Y o1(Sn) 
43 Say = , 
( ) +1 $2(Sn) 
where ¢:(S,) and ¢2(S,) are of degrees kp + 1 and kp respectively. Since the 
equation (43) is unique, our proof is complete. Q.E.D. 


The two previous theorems have dealt only with difference equations with 
constant coefficients. In the following theorem we assume that the coefficients 
are polynomials in n. 

THEOREM 12. There exists some first order difference equations with constant 
coefficients that define rational sequences which converge to Bi” , where 6°” is irra- 
tional (7 = 1, 2, --- ,p — 1) more rapidly than the sequences defined by any first 
order difference equation whose coefficients are polynomials in n. 

Proor. If the difference equation 

l q 
PR n' 2. ai;S', 


(44) ha SS 
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p n' 2. bi S', 
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. ° 1/ . . 
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When | = 0 we know from the previous theorems that there is a maximum order 
\ for the rate of convergence for the sequences. That is, the number of rela- 
tions for the determination of a; and bo; cannot exceed \. The relations (45) 
are similar for all values of <. Hence when l > 0, the number of such relations 
cannot exceed A, without all a,;; and 6;; vanishing. 

When g = kp we know, after Theorem 10 that \ = 2k. Each of these 2k 
relations (45) gives, for each 7, g linear homogeneous equations on the coeffi- 
cients. Since the relations are the same for all 7, when the last system of equa- 
tions are solved in terms of aj, (¢ = 0, 1, --- , 1) (45) reduces to 


l 
$1 (S,,) > n dir 





Sri == I 
2 (Sn) > n' dir 
or 
$1 (Sn) 
Sr = — Ta. 9 
 Ga(Sn) 


where ¢; and ¢2 are polynomials is S, with constant coefficients. 
When g = (k + 1)p — 1, then as in the proof of Theorem 11 for 1 = 0 


p—1 
3 (Sn) > Qokp+j si” 
j= 





Sasi _ p—1 ; 
$4 (Sz) > Aonp+i Sn 
j= 


and since the relations (45) are similar for all 7 
Ll p—l 


3 (S;,) ; a ; & Cikp+i = 


i=0 7=0 
Savi = 1 


lt p— we. 
4 (Sn) DY Do Ginn; Se 


i=0 j=0 





or 
ai eee 
. oy (Sn) ‘ 

where ¢; and ¢4 are polynomials with constant coefficients. Q.E.D. 

ExaMPLe: When k = 1, the first order difference equation of p™ degree whose 
sequences converge most rapidly to B"””, where 8 is rational and 6”” is irrational 
(7 = 1,2,--.,p — 1), is 
(46) PSriiSh — (p — 1)8h — B = 0 
and the one of (p + 1)** degree is 


(47) (p + 1)Snyi82 — (p — 1)S82*" + (p — 1)BSaur — (p + 1)S,8 = 0. 
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The sequences defined by (46) have an order of convergence 2 and those 
defined by (47) have an order of convergence 3. 
It is interesting to note that equation (46) is Newton’s algorithm for the ap- 


proximation of p'”. 


5. Approximation of g’”. In practical work it is often valuable to know an 
upper bound for the error of a given term of a sequence, i.e., an upper bound 
for the difference between a term and the limit of the sequence, or in other 
words, if the terms of the sequence are written in decimal form it is important 
to know a means of asserting that a given term of the sequence is correct to at 
least R decimal places. 

Tureorem 13. If a sequence approaches the real number B'”, such that when 
|S, — BY” | = ethen| Say: — B’” | S Ce’, where C is a constant independent of «, 
(i.e. the sequence has an order of convergence k) and if, € is so small that 
2Ce < «, and if the first i decimal places of the two consecutive terms of the sequence 
are the same, (i > 0); then, the second of the terms cannot differ from B' by as 
much as 2*(10)"“"C. And when C < (10)? the second of the terms is correct to 
ki — 3 — k/3 decimal places. 

Proor. Let e, denote the error of S,. Since C&" < }, 


én < 2e,)1 — C&"| 
= 2\e, — Cé& | < 2||S, — | — | Say" BY? || S 2|S8, — Syas|. 


Now 
| Sn — Snir] < (10), 
hence 
énu S Ce, < C2"(10)™. 
And when 


C < 10’ 
fn < 2*(10)7*' <i ers 


Therefore, S,41 is correct to at least ki — 7 — k/3 decimal places. 

ExamPLe: Consider the sequences defined by (27). For this equation k = 2, 
|C | <1 therefore 7] = 0. For the sequence with initial condition So = 1, Ss 
and SS, differ only in the sixth decimal place, hence S; is correct to 9 decimal 
places. By the same method of calculation S; is correct to 21 decimal places. 

Although it is possible to set up a difference equation whose sequences con- 
verge as rapidly as one desired, there seems to be less calculation in the approxi- 
mation of an irrational number 6”” if the sequences are defined by the simple 
difference equation (46) Newton’s algorithm, which may be written in the form 


(48) ee ge eee 
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We know that the order of rate of convergence of its sequences is 2. Now let 
us determine its region of strong convergence. 
Tuzorem 14. If B’” > 0, the sequences defined by (48) converge to B'” for 


all positive initial values So. Moreover, they approach 8” monotonically when 


the initial values are > B'”. 


Proor. The transformation equation is 


p(Bn41 + 6’) (Sn + B'”)?* — (p — 18, + 8")? — 8 =0 


or 





eatery - J 
nti = Sa} l — - — . 
Ba BS 


For positive values of 8, the expression in the brackets is > 0 and < 1 — 1/p. 
Therefore, the sequences S, decrease monotonically to zero for initial values 
5S, > 0. This is equivalent to the statement that all sequences {S,} of (48) 
determined by initial conditions > 8” converge monotonically to 8”. Hence 
the second part of the theorem is true. 

If0< S, < B’”, then Sasi — Sn = (1/p) {—S, + BSX?} > 0, so that Say: > 
S,. Hence, either the sequence {S,} is monotone increasing tending to a limit 
< p’” or some term of the sequence is greater than 6"? and from this term on the 
sequence decreases monotonically to 6”. In the first case the sequence cannot 
approach a limit K,0 < K < 8"” for the only finite limits for the sequence are 
8’? and —p"” since the limit of the difference equation is S? = 8. Hence our 
proof is complete. Q.E.D. 

Since irrational numbers of the form 6”” are usually accurately computed by 
means of a binomial series, it is interesting to compare the rate of convergence 
of the sequence of partial sums of a binomial series with that of a sequence 
defined by a difference equation. 

THEOREM 15. A sequence of partial sums of a binomial series cannot con- 
verge to 8” with a order of convergence 2. 

Proor. Case I. Positive term series, (1 — 1/a)’ =1L +h tbe +--- + 
b, +--+. Assume that the theorem is false, then the sequence of partial sums 
has an order of rate of convergence 2; i.e. if 


1/ 1 2 


where C is a constant independent of «,. For a rapidly convergent series 
bnti/bn <r <1 forn > mp. The error e, of the partial sum S, is less than 








r . 
by (; im .) and is greater than b,,:. Therefore 


r l 1 r 
w<e(i5) = (0) cox (5) | 
n+1 : 
(, ; h (4) m= Sess < toga | 
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Therefore, if the theorem is false, 
l 1 n+l » o(!) a ( r.) 
n+1 fy nj a"\l-—r 


8-91) cot eee r y. 


n! a"|\1—r 


or 





‘(n+ 1a! 
This last inequality cannot hold for all n, if a > 1. But, since the series is 
convergent, a > 1. Hence we have a contradiction. 
Case IJ. Alternating series. (1 + 1/a)' = 1 — b; + bo — by +... 
Again assume the theorem is false. The error e, is less than 6,4; and is greater 
than Daai — baye. So 


ae ee 
oe <des = (C0 4 1) ao 


| l 1 l 1 | 
(a4 ») ant 7 ae on "T bn4e aaa Dnis < En+1- 


Therefore if the theroem is false 


lf 1 1 l 1 | l re) 
(5 oe = aa" hg Oe q*t 


l—n—2| 


Le eee | 
Seal Rot n+3 <c| l ) al 
| n+2 a? n+1/ art 


This last inequality cannot hold for all n, for the left member approaches a 
constant greater than zero, (a ¥ 1), and the right member approaches zero 
since it is a term of the series. Therefore we have a contradiction. Q.E.D. 

Coro.iary: There are sequences defined by the difference equation (48) which 
converge more rapidly to B'” than the sequence of partial sums of any binomial 
serves. 
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SOME GENERAL APPROXIMATION THEOREMS 
By LawRENceE M. GRAVES 
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1. Introduction. In this paper comparatively simple proofs are given for 
some general theorems on the approximation of a given function by “smoother’’ 
functions. Well known theorems on approximation by polynomials, by trigo- 
nometric sums, and by integral means are special cases. The proofs, given in 
Section 3, are so formulated that there is little more complication in the case 
of functions of m variables than in the case of functions of one variable. 
Theorems on the term-by-term integration and differentiation of the approximat- 
ing sequence are included. In Section 4 certain cases of iteration of the ap- 
proximating transformations are considered. In Section 5 it is shown that 
when the given function is continuous and vanishes on a closed point set £, 
certain types of approximating functions may be required to vanish on a neigh- 
borhood of EF while possessing as many continuous derivatives as may be derived. 

Let A denote the interval 0 S$ 2; S a;, (¢ = 1, --- , m), of m-dimensional 
space, and let B denote the interval —a; S x; S a;. Let {Q,(x)} be an in- 
finite sequence of bounded measurable functions defined on B and let (k,) be 
an infinite sequence of positive numbers. Then if f(x) is a function defined 
and integrable on A, we may set 


Pal;f) = ke | fle) Qu(e — 2) de 
(1:1) rt a 
=k, [ . f f(2)Qn(z — 2) dem» + dar. 


It is observed at once that if the function Q,(z) is of class C“ on B then P, is 
of class C“ on A, and if Q,(x) is a polynomial so is P,. For the most part 
we shall wish to assume that the numbers k, are related to the functions Q, 
by the formula 


(A) 1/k, = / Qn(x) dx = [ eee * Qndtm +--+ da. 


— Om 
Interesting examples of sequences Q,(z) are: 


(i) Q,(z) = [1 — » 2)", with p . ai <1; 


2 


(ii) Q,(x) = II (1 — 23)", with a; <1; 


(iii) Q,(r) = II cos" z;, with a, S 7/2; 


281 
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(iv) Q,(z) =1 for |a;| $1/n, Q,(z) = 0 elsewhere; 
(v)  — Qu(z) = J (\/n® — xi)" for |x| S1/n,  —Qn(z) = 0 elsewhere. 


In examples (i) and (ii) the polynomials P,(z) related to a function f by formula 
(1:1) are frequently called Stieltjes polynomials. In example (iii) we obtain 
trigonometric polynomials P,(xz). In example (iv) the functions P,(z) are 
integral means of f(x) over hyper-cubes. 

We shall wish to consider the following properties which sequences (Q,) 
and (k,) may possess. We let S, denote the interior of the sphere with center 
at the origin and radius e. 

(B) Each Q,(x) is bounded and measurable and non-negative on the domain B, 
and positive on a subset of B of positive measure. 

(C) For every « > 0, lim, knQ,(x) = 0 uniformly on B — 8, . 

(D) Each Q,, is of class C on B. 

(E) If R, denotes a partial derivative of Q, , of total order less than r, then for 
every e > 0, lim, k,R,(x) = 0 untformly on B — &,. 

When the numbers k, are given by formula (A), examples (i), (ii) and (iii) 
possess all these properties, and so does example (v) for n > n, sufficiently great. 
Example (iv) has properties (B) and (C). To verify property (#) for example 
(i), we notice that if R, is a derivative of Q, of order u, then R, is O[n*(1 — 
€)"] uniformly on B — S,. Also, with 6 = n°”, 


1/k, = / f1 — Do ai|"dr= / (1 — 1/n)" dx = bn-™[1 — 1/n)", 
85 S65 
where b is a properly chosen constant, so that k, is O(n”). 
In case the function f is originally defined only on a subset Ao of A, its domain 
of definition may be extended to be the whole of A in a variety of ways. If 
Apo is the interval b; S x; S c;, we may set 


f(%1 , H2, +++ , Lm) = f(bi, %2,---, 2m), 


(1:2) 
Osx <b, BR. SuaSG, +=2,.--,m), 


and so on until the extension is completed. With this extension the properties 
of continuity, of m-tuple absolute continuity, and of satisfying a Lipschitz con- 
dition are all preserved. In place of (1:2) we might use 


(1:3) S(t1 , 2, +++, Xm) = f(2bi — M1, te, --- , Lm). 


This extension preserves also the property of absolute continuity in the sense 
on Tonelli. If f is integrable on a measurable subset Ao of A, we may set 
f = 0 on the complement of Ay. For an extension preserving uniform con- 
tinuity or a Lipschitz condition when Ap is an arbitrary subset of A, the reader 
is referred to McShane’ [10] or Whitney [11, p. 63, footnote]. When f is of 





’ Numbers in brackets refer to the bibliography at the end. 
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class C‘” (in a suitable sense) on a closed set Ao, an extension preserving this 
property has been given by Whitney [11]. A simpler method of extension 
applicable in special cases has been devised by Hestenes, but is not yet published. 

We shall find it convenient throughout the proofs to set f = 0 outside the 
interval A. Then by making the change of variables z; = x; + v; we find that 
for z in A the formula (1:1) may be written 


(1:4) P,(2;f) = kn [ se + v)Q,(v) dv. 


2. Preliminary lemmas. The class of all measurable functions f such that 
|f |’ is integrable on A is denoted as usual by L’, for 1 S p < «. For this 


class set 


1/p 
@:1) stl =| fis ac]” 


We make the convention that L® denotes the class of essentially bounded and 
measurable functions, and for this class denote by || f ||,, the essential upper 
bound of | f(z) | on A. Then if p + g = pq (¢q = © when p = 1) we have 
the Hélder inequality 


@2) || He)o(a) ax 





S |Ifllo-llg lle 


holding when f is in L? and g is in L‘, and the triangle inequality’ 
(2:3) If +g lp = llF lle + lig Il> 


holding when f and g are both in L’. 

We shall have need for two lemmas, which are readily proved. 

Lemma 1. Let f(x) be in L’?, (1 S p < @), and set fi(z) = f(x +h). Then 
\| fx — f ||p approaches zero with h. 

To prove this note that by the triangle inequality (2:3), || f: —J ||p S || fu — 
fille + || fon — fe ||p when f = fi + fe, and || fx —f lp S2\|f|l>- From these 
inequalities it follows that the class E of functions f for which the conclusion 
of the lemma holds is linear and closed in L?. The class E obviously contains 
the characteristic functions of sub-intervals of A. The class of all linear 
combinations of such characteristic functions is dense’ on L’, and hence E = L?. 

Lemma 2. If f(x) isin L?, (1 S p S @), on A, then f(x + v) as a function 
of 2m variables, x in A and v in B, is also in L’.* 

To prove this we note first that if a set E is closed in A, then the set E* of 
points (x, v) of the product space AB for which z + v is in E is likewise closed 





* See, e.g., Hobson [8], vol. I, p. 588. 
* See, e.g., Hobson [8], vol. II, p. 251. 
* Compare Cinquini [6], pp. 59-62. 
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in AB. Consequently if Z is open in A, E* is openin AB. If £ is an interval] 
with measure nH, then E* has measure 


(2:4) pE* = wE- [J a. 


Since every open set may be represented as a sum of non-overlapping closed 
intervals, it follows that (2:4) holds for sets H open in A, and hence for sets 
E closed in A. By applying the criterion that a set E is measurable if and 
only if for every « > 0 there exists a closed set EZ; and an open set EF» such that 
E,CEC E,and u(E, — E,) < ¢, we see that when E is measurable E* is also 
measurable, and (2:4) still holds. Hence f(z + v) is measurable on AB. Let 
w(x) = | f(x) | where | f(x) | S N, fy(xz) = N where | f(z) | > N. Then 


| | tise + oP dae = | | tsCor? doa 


= | OP dv-TT as =f |f0) Pdo-TT a. 


a * be oe 5B . ° e 
Chis with Fubini’s theorem’ implies the desired conclusion. 


3. The general approximation theorems. For the sake of completeness, we 
include the following theorem, whose proof is an immediate extension of that 
given by Landau [1]. 

THEOREM 1. Suppose the sequences (Q,) and (k,) have properties (A), (B), and 
(C). Let Ao be a closed set interior to A, and suppose that f(x) is bounded and 
measurable on A, and continuous at the points of Ao. Then P,(x) converges 
uniformly to f(x) on Ao. 

Let a > 0 be the minimum distance from A» to the boundary of A. For an 
arbitrary 6 > 0 there exists a positive e < a such that 


(3:1) [f(z + v) — f(z) | <6 


whenever x is in Ap and ));v; < &. Now from formula (1:4) and property 
(A) we have 


| Pal) — f(x) | = kn 








[ Ue +o -s@Q.0) wv 


(3:2) Sha [| fle +0) — s(e)| Qa) d 


+ yf (fle + 0)| + |G) 1Qu(0 ae 


The first term on the right is less than 6 for x in Ao by (3:1) and properties 
(A) and (B), and the second term approaches zero with 1 /n uniformly in 2, 
by property (C). 





5 Saks [9], p. 77, Hobson [8], vol. 1, p. 577. 
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TuxorEeM 2. Suppose the sequences (Q,) and (k,) have properties (A), (B), 
and (C). Let f(x) bein L’, (1 S p< ~). Then 
(a) ims | | P,(z) — f(x) |’ dx = 0; 
A 
(b) a subsequence P,,,(x) converges to f(x) almost everywhere on A; 


(c) the integrals / | P(x) |? dx are equi-absolutely continuous; 
E 


lA 


(d) if M is the essential upper bound of | f(x) |, then | P,(x) | S M on A; 


(e) if g(x) isin L'(p +q=pg,1Sps ~), 
a [ | g(x) || Paz) — f(x) | dx = 0. 


By the use of the first equality in (3:2) and the Hilder inequality we have 
Pala) — fa)” = | | Ge + 0) — He)1Qu() aol 
Sk [ [f(x + v) — f@)|? Qn(v) do | [ Q,(v) aw] 
= bey [fle + 0) = $0) Qu(o) a 


From this by Lemma 2 and Fubini’s theorem on interchange of order of integra- 
tion we obtain 


| | P(x) — f(z) |?dax Sk, [ [ I f(a + v) — f(x) |” Qa(v) dx do 
Ph ke [ [ \f(x + v) — f(x) |” dx Q,(v) dv 


+k, [ : [ fle + v) — fle) |? dx Qa(v) do. 


The first term on the right converges to zero with ¢, uniformly with respect to n, 
by Lemma 1 and properties (A) and (B). The second term converges to zero 
with 1/n for each e, by property (C). This proves part (a) of the conclusion. 
Part (b) is a consequence of (a).° Part (c) follows by simple arguments from 
the triangle inequality 


l/p 1/p 1/p 
lf | Pa(e) Pa < If | Paz) — f(z) Par | +|f If(z) P a ; 


part (a), and the absolute continuity of / | P,(x) |" dx and / | f(x) |? dx. 
E E 


* See Hobson [8], vol. II, pp. 239-45. 
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Part (d) is an immediate consequence of (1:4). In case p is finite, part (e) 
follows at once from the Hélder inequality and part (a). In case f is bounded 
and g merely integrable, we may make an indirect proof, supposing that 


lim sup i) | g(x) || P»(x) — f(z) |dzx = B > 0. 
Then for a properly selected subsequence (P,;) we have 
lim [ | g(x) || Pn;(z) — fa) |dx = 8B, 


lim P,,;(z) = f(x) almost everywhere, 
| g(x) ||Pn;(z) — f(x) | S 2M | g(x) | almost everywhere. 


This leads to a contradiction with the theorem on term-by-term integration for 
Lebesgue integrals. 
Lemma 3. Suppose that the sequences (Q,) and (kn) have properties (C) and (D) 


withr 2 1. Let f(x) be absolutely continuous in x; for almost all (a2, --- , 2m), 
and let f and the partial derivative f., be integrable on A. Suppose also that 
S(O, t2,---,2%m) ts integrable. Let Ao be a closed set interior to A. Then 


lim, | Pa(z; fz,) — Pnz,(x;f) | = 0 uniformly in Ao. 
Since the functions Q, are of class C’, differentiation under the integral sign 
is permissible in formula (1:1), and we have 


Pus, (t;f) _ —k» | Fe Quel ai x) dz. 
Hence by Fubini’s theorem and the absolute continuity of f in 2; we have 


P,(2;fz,) — Pas, (x; f) = k / [fe,(2)Qn(z — x) + f(z)Qnz,(z — x)] dz 
(3:3) : 


= hy [= [” W@QaCe — DIEZE dem +++ 
Also 


[tar = [°... [7 W@ HES drm «++ da, 


so that the integrability of f(0, x2, --- , 2m) implies that of f(a: , 22, --- , tm) 
Thus (3:3) with property (C) justifies the desired conclusion. 

Let us recall that a function f(x: --- 2m) of several variables is absolutely con- 
tinuous on A in the sense of Tonelli in case it is continuous in (a1, --+ 2m); 
absolutely continuous in each 2; for almost all values of the remaining variables, 
and each partial derivative f,, is integrable on A. 

THEOREM 3. Suppose the sequences (Qn) and (kn) have properties (A), (B), (C), 
and (D) forr 2 1. Let f(x) be absolutely continuous in the sense of Tonelli, and 
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let the partial derivative f., bein L®, (1 S p< ~). Let Ao be a closed set interior 
to A. Then , 


fa) lim, [| Paes(t) — foc(2) Pde = 0; 


Ao 
(b) a subsequence of (Pnz;) converges to f., almost everywhere on A; 


(c) for E C Ao, the integrals / | Pnz,(x) |? dx are equi-absolutely continuous; 
Ez 


(d) if f satisfies a Lipschitz condition on A, the partial derivatives P,., are uni- 
formly bounded on Ap ; 

(e) if fz, is continuous at the points of Ao, the sequence (Pyz;,) converges to fe, 
uniformly on Ao ; 

(f) of g(x) isin L'(p +q = pg, 1S ps ~&), 


Nin [ 1901) Paocle) — faclz) | de = 0. 
0 
To prove part (a) we use the triangle inequality 


1/p l/p 
I! | Paz:(23f) — fe;(x) P ae s U | Pa(z; fe;) — fe;(z) |? az | 


l/p 
+[f | Prs,(asf) — Pal; fo,) P| | 


The first term on the right approaches zero by Theorem 2 with f replaced by fz, , 
and the second term approaches zero by Lemma 3. To prove (b) we may con- 
sider a sequence of closed intervals A» interior to A but with boundaries ap- 
proaching the boundary of A. The existence of the desired subsequence 
converging almost everywhere on A is then secured by the well-known diagonal 
method. Parts (c) and (f) follow from (a) as in the proof of Theorem 2. Part 
(d) follows from Theorem 2 and Lemma 3, and part (e) from Theorem 1 and 
Lemma 3. 

THrorEM 4. Let the sequences (Qn) and (k,) have properties (A), (B), (C), and 
(D) and (E) forr = m. Suppose that f(x) is absolutely continuous’ in the set of 
variables (4, --- , 2m), and in addition that when one or more of the variables 2; 
is set equal to zero f(x) is absolutely continuous in the remaining set of variables. 
Let t(x) denote a partial derivative of f(x) in which the derivative with respect to 
any one variable x; occurs to at most the first order, and let T,,(x) denote the corre- 
sponding partial derivative of P,(x). Let t(x) be in L’, (1 S p < ~), and let 
Ao be a closed set interior to A. Then 


(a) tim, [ | 7',(x) — t(x) |? dx = 0; 
Ao 
(b) a subsequence of (T';,) converges to t almost everywhere on A; 


(c) for E © Ap the integrals / | T(x) |” dx are equi-absolutely continuous; 
E 





* That is, m-tuply absolutely continuous. 
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(d) if t is bounded on A, the sequence (T',) is uniformly bounded on Aj ; 

(e) if t is continuous at the points of Ao, the sequence (T',) converges to t uni- 
formly on Ao. 

(f) the functions P,(x) are equi-absolutely continuous in the set of variables 
(a1, --+,2%m) on Ao. 

We shall carry through the proof for the case of two independent variables. 
The method of extending the proof to other cases and of obtaining various 
modifications of Theorem 4 will then be apparent. The hypothesis on the 
function f(a; , x2) implies that it may be represented in the form 


fla, 12) = f(0, 0) + [ ” fakin, @ de 


+ [ fe.(0, 42) daz + [ [ Segz, AX2 dx). 


It is easily seen that the hypotheses of Theorem 3 are verified, so that the 
desired properties of P,z, and P,z, follow from that theorem. Now the hypothe- 
ses of Lemma 3 are verified with f replaced by f., , so that 


(3:4) lim | P»(x; fey2,) — Paz,(t; fz.) | = 0 


uniformly on Ag. Also the hypotheses of Lemma 8 are verified with Q, replaced 
by Rn = Qnz,. Now 


Pray(t fos) = ln | fay) Rule — 2) de 
so that by Lemma 3, 
(3:5) lim | Paz,(%}; fe.) — Paz,z,(2;f) | = 0 


uniformly on Ap. Combining (3:4) and (3:5) we have 


lim | P,(2; feys,) ae Prz,2 (2; f) | =0 


uniformly on Ag. From this and Theorems 2 and 1 the conclusions of the 
theorem for (x) = f,,2, then follow as in the proof of Theorem 3. The final 
conclusion (f) follows at once from (c) with p = 1 and T,,(r) = Paz,z - 


4. Iteration of the approximating transformations. In the preceding dis- 
cussion we have considered the linear transformations P,(f) given by formula 
(1:1), which transform certain normed linear spaces F = [f] into subsets of 
themselves. In certain cases these transformations are also equi-continuous, 
that is, there exists a constant M such that 


(4:1) | Pa(f) || = M |IF || 
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for every n and every f in £. They may also have the property that 
(4:2) lim || P.(f) — f || = 0 


for every f in & When the sequence (P,) has the properties (4:1) and (4:2) 
and (K,) is another sequence of transformations, not necessarily linear, but 
having at least property (4:2) it follows that 


(4:3) lim || Po Kif — f || = 0. 
qa, n 


Thus when the transformations K,, are also linear and have property (4:1), the 
composite transformations K,P, have the same properties (4:1) and (4:2) as 
the factors have. In particular the iterates P), , (s = 2, 3,---), have prop- 
erties (4:1) and (4:2). 

We now proceed to discuss some cases to which the argument of the preceding 
paragraph is applicable. 

Case I. The space ¥ consists of all functions f continuous on a closed rec- 
tangular region Ao interior to A, and || f || is the maximum of | f(x) | on Ao. 
We extend the definition of f over the remainder of A as indicated by (1:2). 
Then by properties (A) and (B) and the formula (1:4), || P.(f) || S || f ||, so 
that (4:1) holds true with M = 1. The condition (4:2) holds by Theorem 1. 
Note that before another transformation K,, of the same kind is applied the 
functions P,(x; f) are to be regarded at first as defined only for x in Ao, and 
then the same extension of definition over the remainder of A is to be used as 
was used for f itself. This procedure seems to be necessary in order that the 
preceding arguments may be validly applied. In examples (iv) and (v) it is 
easily seen that this precaution is unnecessary for many applications of the pre- 
ceding result. 

The iteration of the transformations P, seems to be of special interest when 
the P,(f) are the integral means of f, that is, they are defined by the Q, of 
example (iv). For in this case when the function f is of class C“” on the region 
Ao, the transformed function P,,(f) is of class C“*” on an arbitrary closed sub- 
region A; interior to Ao, for n sufficiently large, and-hence F%,(f) is of class 
C°™ on an arbitrary closed subregion A, interior to Ao, for n sufficiently large. 

Case II. The space ¥ consists of all functions f defined and of class L’, 
(1 S p < &), on a fixed measurable subset Ao of A, and 


isi=[f isieae]”. 


In this case we agree to set f = 0 on the complement of Ao. Then by the 
same argument as that used at the beginning of the proof of Theorem 2, we find 


[ Pe) "dx < ka ff | f(x + v) |” dr Q,(v) dv < kf FI? Qn(v) dv = | f|I?, 


so that (4:1) holds, again with M = 1. Theorem 2 yields the property (4:2). 
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If we take Ao to be the whole of A and the transformations P,(f) to be the 
integral means of f, the iterated transformations P;, yield approximating func- 
tions of class C“~” at least on an arbitrary closed subregion A, interior to A, 
for n sufficiently large. 

Case III. The space ¥ consists of all functions f absolutely continuous in 
the sense of Tonelli on a closed interval Ao interior to A, and 


fll = maximum {4 Lf fu | do}. 


We extend the definition of f over A as indicated by (1:3), and let M denote a 
constant sufficiently large to satisfy all the conditions imposed on it. As in 
case I we find | P,(#; f)| S ||f ||, and as in the proof of Lemma 3 we obtain 


[ | Pas, (23S) — Plz; Se,) | dz 


ee = LL [7 eres  aniscs" dem «++ dea| de 


shall sll ff» [Qa —2) [9 + Qule — 2) dam «+ dead 


S2||f ||-a2--- am S M || f||/2. 


By case II we have 


[| Palast) des f falar s MINS 
0 
and combining this inequality with (4:4) (with x, replaced by 2;,), yields 
[| Prosar) |e <M I 
0 


so that (4:1) holds. The condition (4:2) follows from Theorems 1 and 3. We 


note that in this case example (iv) must be ruled out, since it does not satisfy 
the hypotheses of Theorem 3. 


In case the function f satisfies a Lipschitz condition, and L(f) denotes the 
minimum Lipschitz constant for f on the interval Ao , it follows from the proofs 
of Lemma 3 and Theorem 3 that there is a constant M such that 


(4:5) L(P»(f)) S M[L(f) + max | f(z) |]. 


If (K,) is another sequence of transformations having property (4:5) and such 
that | K,(x; f) | < M-max | f(z) |, then 


L(P(Knf) S$ M*[L(f) + 2 max | f(z) |], 


so that the double sequence of functions P,K,f satisfies for each f a uniform 
Lipschitz condition. 
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§. Approximating functions vanishing where f vanishes.” In this section we 
suppose that the function f is continuous on the closed interval Ao interior to A, 
and vanishes on the closed set A1in Ao. We may agree to extend the domain 
of definition of f to be the whole of A by the method indicated in (1:2) or by 
that indicated in (1:3). Consider the sequence of transformations K, defined 


as follows: 
K,(x;f) = f(x) — 1/n where f(x) 2 1/n, 
= f(x) + 1/n where f(z) S —1/n, 
= 0 where —1/n S f(x) S 1/n. 


Let the transformations P, correspond to the Q, of example (v). Then for each 
n there is an integer q, sufficiently great so that the function P,,K,f vanishes 
on a neighborhood of the set A: , and is of class C“” on A. 

In case I of Section 4 we find that the K, satisfy (4:2) so that 


(5:1) lim || PeKaf — f|| = 0, 
(5:2) lim || Po, Kaf —f || = 90. 


In case III, where the function f is supposed to be absolutely continuous in 
the sense of Tonelli, we consider first the case when f 2 0. Then each partial 
derivative f.; vanishes wherever it exists on the set where f vanishes. Thus 
K,z; = f-, almost everywhere on the set where f = 0 and almost everywhere on 
the set where f > 1/n, and | Knz; | S | fz; |, lima Kuz; = f:; almost everywhere. 
Since each function absolutely continuous in the sense of Tonelli is representable 
in a standard way as the difference of two non-negative functions having the 
same property, we find 


lim [ | Kne,(23f) — fe,(x) | de = 0, 


so that (4:2) holds in this case also. Consequently we obtain (5:1) and (5:2) 
for this case, by the results of Section 4. 

When the function f satisfies a Lipschitz condition, the transformed functions 
K,f obviously satisfy the same Lipschitz condition, and hence by the last para- 
graph of Section 4 the functions P,K,f satisfy a uniform Lipschitz condition. 

The results of this section show that the following theorem is valid. 

THEoREM 5. Let f(x) be continuous on a bounded closed set Ao, and vanish 
onasetA,;C Ag. Then there exists a sequence of functions ¢,(x), each of class C ” 
on the whole space and vanishing on a neighborhood of the set A;, such that 
lim, ¢n(x) = f(x) uniformly on Ao. In case the set Ao is an interval and the 


* Cf. Reid [12], p. 859. 
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function f is absolutely continuous in the sense of Tonelli, the sequence (¢,) may 
be required to satisfy also the condition that 


tim [| dno: — foc | dx = 0. 
n Ao 


When the function f satisfies a Lipschitz condition on the set Ao, the functions ¢, 
may be required to satisfy also a uniform Lipschitz condition on Ag. 

Note that the last statement is valid when the set Ao is an arbitrary bounded 
set, since a Lipschitz function may have its domain of definition extended over 
the whole of space without losing that property. 


Tue UNIversity or CHIcaGo, 
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TAUBERIAN CONDITIONS' 


By RautpH PALMER AGNEW 
_ (Received March 30, 1940) 


1. Introduction. It is the object of this paper to introduce several new 
Tauberian classes and to discuss their relations to each other and to various 
Tauberian classes which have been previously considered. 

As the title indicates, our interest lies in Tauberian conditions themselves 
rather than in Tauberian theorems involving them. In order that our discus- 
sion of Tauberian theorems may be as free as possible from complications which 
would draw attention from the Tauberian conditions, we consider in this paper 
only Tauberian theorems for the C, (Cesaro of order 1, or arithmetic mean) 
method of summability. That the fundamental Fourier transform method of 
Wiener’ can be applied to give general Tauberian theorems for the classes 
T, T*, T, , and T’ which we shall define is shown by a paper of H. R. Pitt to 
which we refer in §10. It is probable that the methods of Wiener and Pitt 
apply also to the more general classes 7’ of §11. Each of the classic C; Tau- 
berian theorems to which we refer in §2 is a corollary of Theorem 9.1. 


2. Classic Tauberian theorems. Let u; + we + --- be a series (convergent 
or divergent) of complex terms, and let 
n 1 n 
Sn = Dy Ue} on =- Dd & n = 1,2,--- 
k=1 N k=1 


be respectively the sequence of partial sums and the C; transform of )> up. 
An elementary Abelian theorem states that if s, — s, then o, — s;in other words, 
that C; is a regular method of summability. That co, — o does not imply s, — ¢ 
is illustrated by the series 7 (—1)"*" for which o, — 3. If, however, on — ¢ 
and the terms of >> up, satisfy certain auxiliary Tauberian conditions, then s, > 
s. The following, in which 7 denotes a class of series, is a typical Tauberian 
theorem. 

2.1 If on > o and >> un € T, then 8, — o. 

The essential part of the conclusion is that s, converges; for if it is known 
that 8, converges, then regularity of C; implies that s, — o. 

If T is the class T'¢ of convergent series, then 2.1 is true but trivial since the 
class 7’ is too small to:make 2.1 significant. If 7’ is the union of the class 7c 
and the class 7’p of all divergent (non-convergent) series not summable C;, 
then again 2.1 is obviously true; but again the theorem is trivial since the defini- 





. Presented to the American Mathematical Society, February 24, 1940. 
?\N. Wiener, Tauberian Theorems, Annals of Math., vol. 33 (1932), pp. 1-100. 
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tion of 7 requires that one know whether s, converges before one can decide 
whether 2. Un satisfies the hypothesis of 2.1. If 2.1 is to be useful, there must 
naturally be some series for which it is easier to determine that }~ u, ¢ 7 than 
it is to determine whether 2, u, converges. For application to a given series 
) ® u, belonging to some Tauberian class T, the best theorems are those involving 
T for which it is easiest to show that >> u, € 7. These considerations indicate 
that there probably is no single “best”? Tauberian class which will eventually 
supplant all others. 

A simple Tauberian theorem states that if o, — ¢ and >> u, satisfies the 
condition nu, — 0 of Tauber’ then s, — ¢. A more general Tauberian theorem 
asserts that if ¢, > ¢ and >> u, satisfies the condition n | u, | < K of Hardy‘ 
then s, — o¢. A unilateral Tauberian theorem states that if ¢, — o and 2. Un 
is a real series satisfying the condition nu, <-K (or the condition nu, > —K) 
of Landau’ then s, — ¢. The last theorem was extended by Lukacs’ to cover 
complex series, by the next theorem in which S’ denotes the set of points in 
the complex plane which lie in a sector with vertical angle less than 7. If 
o, — o and nu, €S’, then s, — o. The conditions nu, > 0, n|u,| < K, 
nu, < K, and nu, eS’ (as well as some conditions of Schmidt which we discuss 
in §7) are known as order conditions. The Tauberian condition of the next 
theorem is known as a gap condition.’ If ¢, > o and u, = 0 when n ¥ n, 
Me,-++, Where m < m <--- is a sequence of indices for which lim 
infyp..Mp+i/Np > 1, then s, — o. “Gap Tauberian theorems” such as the 


8 . - isi 
above have been called’ remarkable since there is no order condition on the 
terms u, . 





A. Tauber. Ein Satz aus der Theorie der unendlichen Reihen, Monatshefte fiir Mathe- 
matik und Physik, vol. 8 (1897), pp. 273-277. 

*G. H. Hardy, Theorems relating to the summability and convergence of slowly 
oscillating series, Proceedings of the London Mathematical Society, Series 2, vol. 8 (1909), 
pp. 301-320. 

6. Landau, Uber die Bedeutung einiger neuerer Grenzwertsitze der Herren Hardy 
und Axer, Prace Matematyczno-Fizyezne, vol. 21 (1910), pp. 97-177. See also E. Landau, 
Darstellung und Begrundung einiger neuerer Ergebnisse der Funktionentheorie, 2nd Edi- 
tion, Berlin, 1929. 

°F, Lukacs, Bemerkung zu einem Konvergenzsatze des Herrn Landau, Arch. d. Math. 
u. Phys. (3) 23 (1915), pp. 367-378. For further results and references, see L. S. Bosqnquet 
and M. L. Cartwright, Mathematische Zeitschrift, vol. 37 (1933), pp. 416-423; and N. 
Wiener, ibid., vol. 36 (1933), pp. 787-789. 

7See Hardy and Littlewood, A further note on the converse of Abel’s theorem, Pro- 
ceedings of the London Mathematical Society, Series 2, vol. 25 (1926), pp. 219-236. This 
fundamental paper removes order conditions from the hypotheses of Tauberian theorems 
due to Landau (1907) and Littlewood (1910). For a recent and brief proof and discussion 
of gap theorems, see A. E. Ingham, On the high-indices theorem of Hardy and Littlewood, 
Quarterly Journal of Math., Oxford Series, vol. 8 (1937), pp. 1-7. 

* For a recent example, see N. Levinson, General Gap Tauberian Theorems I, Proceed- 
ings of the London Mathematical Society, Series 2, vol. 44 (1938), pp. 289-306, p. 289. 
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3. The class T. Let K be a positive constant, and let @ be an angle for 
which 0 < 6 < x/2. Corresponding to each real angle y, let S(y) = S(K, @, p) 
denote the “sector with vertical angle < 7’’ consisting of all points z of the 
complex plane representable in the form 


¢=u = Ke™ + pet 


where p = Oand -@ S$ ¢ S @. The sector S(y) has its vertex at the point 
—Ke'’; the angle at the vertex of the sector is 26; the half-line bisecting the 
sector passes through the origin and makes an angle y with the positive real 
axis; and each sector S(¥) may be obtained by rotating the particular sector 
S(0) about the origin. The sectors S(y) are special sectors in that the bisectors 
pass through the origin; however if S’ is any sector with vertical angle < 7, 
then it is easy to determine K, 0, and y so that S’ is a subset of S(K, 0, y). 

DEFINITION 3.1. A series >> un will be said to belong to class T if K, 8, i, 


and ¥1, ¥2, v3, --- exist such that K > 0,0 < 6 < w/2, > 1, and, for each 
sufficiently great index k, 
(3.11) nun € Six) = S(K, 0, vi) kin<™M. 


The Tauberian condition nu, eS’ of §2 implies that there is a fixed sector 
8’ which contains all elements of the sequence nu, ; hence the condition nu, ¢ S’ 
may be termed a fixed-sector condition. On the other hand the Tauberian 
condition ) un € T allows diffierent sectors S to contain different aggregates of 
consecutive elements of the sequence nu, ; hence the condition > u, € T may 
be termed a variable-sector condition. 

A real series }> un satisfies the fixed-sector condition nu, ¢ S’ if and only if 
it satisfies at least one of the two unilateral conditions nu, < K and nu, > —K. 
The condition >> un € 7 does not imply either of these unilateral conditions. 
In fact it is easy to show that a real series >> up is in class T if and only if K > 0 
and \ > 1 exist such that for each sufficiently great k, say k > ko , either 


(3.12) Nun <K ksn< Xk 
or 
(3.13) NU, > —K kin< nk. 


The essential point here is that (3.12) may hold for a certain set of values of 
k > ko while (3.13) holds for the remaining values of k > ko. 


4. Some special series in class T. The next two theorems show that if 
dX u, satisfies one of the Tauberian conditions of §2, then > un ¢ T. 

TuroreM 4.1. If nu, € S’, where S’ is some sector of the complex plane with 
vertical angle < m, then >, une T. 

This is obvious since we can choose a sector S(K, 6, ) such that S’ C S(K, 
6, ¥) and then take y, = y for each k = 1, 2, --- to show that )puneT. It 
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follows that if >> u, satisfies one of the other order conditions of §2, they 
unre T. 
THEOREM 4.2. If ny < ne < --- is a gap sequence such that 
lim inf Np41/N», > 1, 
po 
and u, = 0 when n #~ m,m,---, then > u,eT7. 
Choose \ > 1 such that lim inf n,4:/n, > A, and then choose an index P 
such that 


IV 


2 


For each k 2 n, there is at most one index n in the interval k S n < Xk for 
which u, # 0. We can assume that u, ¥ 0 for an infinite set of n for other- 
wise certainly }> u,e 7. For each k = 1, 2,--- let yy be the amplitude on 
Of Un = pn exp ig, where p, > 0, —7 < ¢, S 7m, and n is the least index such 


that n 2 k and u, # 0. Then, however K > 0 and 0 < 6 < 2/2 are deter- 
mined, the relation 


Np4i > ANp p 


nu, € S(K, 8, x) kiom<m 


holds when & 2 kp and accordingly ZZ une 7. The series satisfying the hy- 
pothesis of Theorem 4.2 are gap series with no condition whatever on u, when 
n has one of the values n, m2, ng, --- 
It is possible to generalize Theorem 4.2 in various ways; for example, the 
following theorem is an immediate consequence of Theorems 4.2 and 6.1. 
THEOREM 4.3. If m < nm < --- is a gap sequence as in Theorem 4.2 and 


n|uUn| << K n #~N,Mm,-::: 
then > <7. 

Each series }> u, satisfying the hypothesis of Theorem 4.3 is representable 
in the form >> (u, + v,) where each one of the series >> u, and >> v, satisfies 
a classic Tauberian condition of §2. That 7 contains important classes of 
series not so representable is illustrated by the fact that 7’ contains each ordinary 
Dirichlet series with non-negative coefficients. 

THEOREM 4.4. Jf a, = O then >. a,n ~¢T' for each complex z. 

Setting u, = a,n “and z = x + ty where x and y are real, we find that nu, = 
A, exp iy, Where A, = na,n* = Oandg, = —ylogn. If 0 < @ < 2/2, and 
\ > 1 is chosen such that | y | log A < @, then 

lon — Ge | Sl|y|logrA < 6 kin<M 

and it follows that }) u,¢7’. In case the argument z in the Dirichlet series 

a,n “is pure imaginary say z = iy, the hypothesis a, = 0 can be relaxed 
(see Theorem 6.2) to give 

TuroreM 4.5. If >> a, € 7, then >> ann” € T for each real y. 

In particular, if a, is real and a, < K (ora, > —K), then yan a,n € T for 
each z on the line Rz = 1. 
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It would be possible to mention other interesting subclasses of 7’; we have 
merely given enough to indicate that Tauberian theorems which are proved 
for the class 7’ have diverse applications. 


5. Characterizations of 7. The following two quite trivial theorems give 
echaracterizations of 7 which are useful in proofs of Tauberian theorems in- 
volving 7’. 

THeorEM 5.1. In order that Dd une T, it is necessary and sufficient that 
K>0,0<0<2/2,a <1 < B, andy, Ye, --- exist such that for each suffi- 
ciently great index k 


(5.11) NUn € S(vx) = S(K, 6, vi.) ak<n< Bk. 


Sufficiency is obvious. To prove necessity, let Zz; un € 1’ and choose K > 0, 
0<6< 7/2, > 1, and vi ‘ ¥2, --- such that for each sufficiently great k 


(5.12) nun € S(K, 8, vi) kon < XM. 
If in (5.12) we replace k by k’ = [k\~“”] and choose a and B such that 

(5.13) i teti<¢pe <x" 

we find that, for k sufficiently great, 

(5.14) nun € S(K, 0, vir) ak <n < Bk. 


Thus (5.11) holds when y; is yy, and the theorem is proved. 

THEOREM 5.2. In order that rv uneT, it is necessary and sufficient that 
K>0,0 < @ < 4/2, u < 1, and wy, pe, --- exist such that for each sufficiently 
great index k 


(5.21) NUn € S(vx.) => S(K, 6, Vx) uk <n =< k. 


Necessity is implied by Theorem 5.1. Proof that (5.21) implies (5.11), and 
hence = u, € 7’, differs very little from our proof that (5.12) implies (5.11). 


6. Properties of T. The definition 3.1 of 7 obviously implies that if : @ Une T 
and uw, = up for all sufficiently great n, then >> u,¢7. In particular if w+ 
Wo + ug + ---€T7, then (u. — A) + uw + uz + --- € 7 for each complex 
number A. 

It is easy to show that if > Ru, € T and }> Ju, eT, then >) u, € 7; that is, 
that = un belongs to class 7’ if the series of real parts and the series of pure 
imaginary parts belong to the class. However simple examples show that 
>» u, € 7 implies neither > Run eT nor >) JuneT. The class 7 is not linear. 
For example, if u, = (—1)" + 1 and uw, = (—1)”" — 1, then pa u, €T and 
Lu eT; but © (uw, + uy) eT fails. 

THeoreM 6.1. If }\ une T and n|vn| < Ki, then D> (un + vn) € 7. 
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Choose K > 0,0 < @ < 7/2,’ > 1, andy, we, --- , and an index ky such 
that when k > ko 


(6.11) nun € S(K, 6, yx) kin<m. 


Elementary computation shows that 


(6.12) n(Un + Un) e S(K + Ky ese 0, 8, x) kin<™, 
and hence that >> (un + vn) € 7’. 
TuEorEM 6.2. If >) une T and Bi, Be, --- is a real sequence such that 
(6.21) lim lim sup max |B, — B,| = 0 
A 1+ ko 80 kn <dk 


then >> Un exp iBn € 7’. 
Obtaining (6.11) as above, we have, when k > ko andk S n < dk 


(6.22) nu, = —Ke* + pane Eten) 


where p, 2 Oand | ¢g, | S$ @< 2/2. Choose 6’ > Osuch that 6” = 60+ 0 < 2/2. 
Let \ be decreased and let ko be increased if necessary so that, when k > ky 
and k < n < Mf, (6.22) holds and also | 8, — Bj | < 6’. If we set Vi = Vi + B 
and g, = ¢n + Bn — B (our notation does not take into account the fact that 
y, as well as p, and ¢, depend on k) we obtain 


(6.23) nune™ = —Kei etn) + pane veten) 


where p, = 0 and |y,| < |¢,| + |B. — B | S 6”. It can be shown that 
Nun exp 1B, € S(K ese 6”, 6, yi) when k > ko and k < n < dk and Theorem 
6.2 is proved. 


TurorEeM 6.3. If >. un,eT and a, a, --- is a bounded sequence of real 
non-negative constants, then >> anun € 7. 
Obtaining (6.22) as above, we find that 


NOnln = —Kayne™ + anpne’* *, 
If K’ is the least upper bound of the non-negative numbers Ka, , then 
NOnUn = — K'e + pace ves) 


where p, = Oand —6 < yn < 0; hence ps Ann € T’. 

Theorem 6.3 can be used to show that if >> u, € 7’, then each series obtained 
by adjoining or removing a finite set of terms at the beginning of the series is 
also in class 7’; if q is a positive integer and one of the two series 


Ute +us+-:-- 
Ug + Ugyi + Ugse + --- 


is in class T, so also is the other 
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7. The class 7*. The developments of this section are largely extensions 
of the ideas of “langsam oszillierend’”’ and “langsam abfallende” sequences 
introduced by Schmidt.” 

It is well known that if >> u, satisfies the Tauberian condition n | u,| < K 


then the simple estimate, 


q q 
a-elt 2 imietet 2 


n=p+l n=p+l 
< K (log g — logp + o,) S K log (8/a) + %, 


in which ak < p <q < Bk and o, — 0 as k — , shows that the partial sums 
s, of >, Un satisfy the Tauberian condition 
(7.01) lim lim sup maximum |s, — s,| = 0. 

a—1l—,B-1+ ko ak<p<a<pk 
Since 

maximum | s, — s,| S maximum | s, — s,| 

ksin<pk ak<p<q<pk 

when a < 1 < 8, the condition (7.01) implies that 


(7.011) lim lim sup maximum |s, — | = 0; 
A1+ ke ksn<rk 
and it is easy to show that (7.011) implies (7.01). 
It is likewise well known that if >> u, is a real series satisfying the unilateral 
condition nu, > —XK, then a similar computation shows that 


(7.02) lim _lim inf minimum (s, — sy) 2 0. 
a-l—,B-1+ k-»c ak<p<q<pk 


If (7.02) holds, then obviously 


(7.021) lim lim inf minimum (s, — s;,) = 0. 

A 1+ 8 k-00 kon<rk 
If (7.021) holds, then corresponding to each ¢ > 0 there exists Ao > 1 such that 
when 1 < A < Xo there exists kyo = ko(A) such that (s, — s,) > — ewhenk > ko 
andkSn< rk. If 1 <\ <r andvA*? <a <1 <6 <)d'’ andak < 
pr <q < pk then, for k sufficiently great, (s, — s,) > —« and (8s, — 8,) > 
—eso that (sg — s,) > — 2; and hence we can obtain (7.02). Thus the Tau- 
berian conditions (7.02) and (7.021) are equivalent. Schmidt (loc. cit., p. 136) 
has called a real series “langsam abfallend” when its partial sums s, have the 
following property: If g = q(p) is a sequence of indices such that q 2 p for each 
p= 1,2,--- and q/p > 1 as p > », then lim infp.(s, — 8p) 2 0. Using 
this definition, it is easy to show that the class of “langsam abfallend” series is 
identical with the class of series satisfying (7.021), and hence also identical 
with the class of series satisfying (7.02). 





r Robert Schmidt, Uber divergente Folgen und lineare Mittelbildungen, Mathematische 
Zeitschrift, vol. 22 (1925), pp. 89-152; p. 132 and p. 136. 
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In a similar manner, it can be shown that if a real series > wu, satisfies the 
Tauberian condition nu, < K, then }> wu, satisfies the equivalent Tauberian 
conditions 


(7.03) lim lim sup maximum (s, — sy) S 0. 
a-l—,B>1+ k-0 ak<p<a<Bk 

and 

(7.031) lim lim sup maximum (s, — s,) S 0. 


A71+ =k kin<rk 


That (7.02) and (7.03) are essentially different conditions is evidenced by the 
fact that the sequence s, = n satisfies (7.02) but not (7.03), and the sequence 
8, = —n satisfies (7.03) but not (7.02). 

Numerous Tauberian theorems involving series satisfying such conditions 
as (7.01) --- (7.031) can be found in a paper of Szasz,”° a monograph of Kara- 
mata," and references given by these authors. A double series analogue of 
(7.02) has been used by Knopp” to obtain Tauberian theorems for double 
series. 

We proceed to show that if }) ua ¢ 7’, then the partial sums of )~ u, satisfy 
a condition more general than (7.01), (7.02), and (7.03). Let }> une 7. Then 


K > 0,0 < @ < r/2, a <1 < Bo, and ——i, —ye, --- exist such that for 
each sufficiently great k 

(7.04) nun = —Ke * + pre’ veten ak <n < Bok 
where pn 2 Oand|¢,| S 6. Ifa <a<1 < B < Bo, then for each real 6, 
(7.05) nue et®) — —Keie 4 pneien tte) 

when ak < n < Bk. Let 

(7.06) A = r/2 — 8. 


If | 5. | < A, then (7.05) and the inequalities p, = 0 and |g, + &| S$ 4+ 
A = r/2 imply that 
(7.07) Rune) > —K ak <n < Bk. 
Hence, when | 6 | S A and ak < p < q < Bk, 

R (Sq — 8,) of Vets) — 3 Runeit® > _K > 1 
7.08) n=p+l n=pt+i 
—K log (8/a) + o 


1° O. Szasz, Converse theorems of summability for Dirichlet’s series, Trans. American 
Math. Soc., 39 (1936), pp. 117-130. 

4 J. Karamata, Sur les theoremes inverses des procedes de sommabilite, Hermann and 
Cie, Paris (1937), 47 pp. 


12K. Knopp, Limitierungs—Umkehrsitze fiir Doppelfolgen, Math. Zeit., vol. 45 (1939), 
pp. 573-589. 
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where 0, denotes a quantity which, when a@ and £ are fixed, converges to 0 


as k — ow, 
Setting 
(7.09) Fi(a, B, k, 6, vn) = mimimum KR (8, * 8») ow 
ak<p<a<pk 
we see that when | &| S A 
(7.10) Fi(a, B, k, dx» Ye) 2 — K log (8/a) + ox. 
Setting 
(7.11) F.(a, B, k) = max min Fy(a, B, k, by . Vi) 
—o<ve<oo [54/54 
we see that 
(7.12) F2(a, B, k) 2 —K log (8/a) + o% ; 
hence 
(7.13) lim inf F2(a, B, k) = —K log (6/a) 
k—00 


and finally, since the left member of (7.13) is monotone increasing as a > 1—, 
8— 1+, 
(7.14) lim lim inf F(a, 8, k) = 0. 
a—1l—,B-1+ k-0 
DeFIniTIoN 7.2. Let T* denote the class of series >) un whose partial sums 
satisfy the condition: a constant A exists such that® 0 < A < 2/2 and 


lim lim inf maximum 
a-~l—,B-1+ k-0 —wo<vEe<oo 


(7.21) zs Ny Bae 
minimum minimum &(s, — s,)e"**"”* = 0. 


}6z)S4 ak<p<a<Bk 

Since (7.21) is merely the result of substituting for F2(a, 8, k) in (7.14), and 
we have shown that (7.14) is implied by the hypothesis that >> uw, ¢ 7’, we have 
proved os 

TuxorEM 7.3. If >> une T, then D> un € 7*. 

The condition (7.21) would be unchanged in meaning if the subscript k were 
removed from ¥, and 6; ; we retain the subscript because it serves to emphasize 
the meaning of (7.21). It is the “variable sector” feature of the class 7’ which 
accounts for the presence of the y’s in (7.21), and it is the presence of the y’s in 
(7.21) which gives the class 7’ its generality. It should of course be observed 
that, in the parade of limits and extrema in (7.12), the operation of taking 
the maximum over — © < y¥, < © is the only one which strives to make the 
inequality sign in (7.12) run the right way. The condition that >) un ¢7* 
(as well as the condition Bi un € 7’) may be stated roughly as follows: in the 





‘8 The left member of (7.21) is a monotone increasing function of A; hence if (7.21) holds 
for some A > 0, we can suppose 0 < A < 2/2. 
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infinite succession of “jumps” in the complex plane from s; to 8 to s; to ... 
jumps which are advanced in the sequence and too close together must not 
be both too large and in directions too nearly opposite. 


8. Properties of 7*. It is easy to show that if >> un satisfies one of the con- 
ditions (7.01), (7.02), and (7.03), then }) u,e7*. That 7* includes series 
>> wn for which (7.01), (7.02), and (7.03) all fail is illustrated by large classes 
of series and in particular by many real gap series. That 7* contains series 
not in T is illustrated by the fact that 7* contains each convergent series while 
T does not. For example, if s, = (—1)"/(n + ) aa then > U» is not in the 
class 7. It is apparent from (7.21) that if uw + we + --- is a series in 7*, then 
(wu; — A) + w+ --- isaseries in 7* for each complex constant A. 

It is possible to characterize T* by conditions similar to but different from 
(7.21). If we replace y. by ¥ + 7 in (7.21) and remove the factor e” = —1, 
we see that >> u, € 7* if and only if A > 0 exists such that 

lim lim sup minimum 
8.01) a—1—,Bp+14+ — k-r00 ~—eSteee 
maximum maximum (s, — s,)e"“***® < 0, 
|5x|S4 ak<p<a<pk 
It thus appears that, while (7.02) and (7.03) are essentially different conditions, 
(7.21) and (8.01) differ only in appearance. 

TurorEM 8.1. In order that >) u,¢T*, it is necessary and sufficient that 
A > 0 exist such that 
lim lim inf maximum 
(8.11) A> 1+ ke —o<vE<0o 

minimum minimum R(s, — s,)e""***® > 0. 
18x 1<A kSn<rk 

Necessity is a consequence of the fact that ifa <1 < B = X, and R,,. isa 
real double sequence, then 
(8.12) minimum R,,, = minimum R,,,. 

kon<k ak< p<q<Bk 

To prove sufficiency, suppose (7.21) fails. Then, when A > 0, a constant 
H = H(A) > 0 exists such that the left member of (7.21) is less than —H. 
Hence a < 1 < fy exist such that, when a and @ are fixed numbers satisfying 
the inequality a < a < 1 < B < fp, the inequality 


(8.13) maximum minimum minimum &(s, — s,)e""***” < —H 

—w<¥E<eo 18%:|S4 ak<p<a<pk 
holds for an infinite set of values of the index k, say ki < ke < ---. Whenk 
has a fixed one of the values kh; , ke, --- there exist indices p, and q such that 
(8.14) ak <p. < a < Bk 
and 
(8.15) maximum minimum R(s,_, — sp,)e"*” < —H. 


—w<y <oo [8|;<4 
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Since (8.14) implies that p,e S qe < Ap, where X = B/a > 1, we can show that 
(8.15) contradicts (8.11) and Theorem 8.1 is proved. 

The condition (8.11) is more convenient than (7.21) when one wishes to show 
that D> ua € 7*; but the condition (7.21) which we have featured in the defini- 
tion of 7* is more convenient than (8.11) when the condition >> u, ¢ 7* is used 
as a part of the hypothesis of a Tauberian theorem. 

Our definition and discussion of 7* naturally applies to series }~ u, whose 
terms are real. Since much of the existing Tauberian theory applies only to 
series with real terms (though in many cases there is an immediate application 
to series of complex terms of which the real and imaginary parts separately 
satisfy the Tauberian conditions involved) it may be of interest to see that 
the condition (7.21) can be thrown into a form somewhat simpler when )> u, 
isreal. Using the definition 7.2 of 7*, we can prove 

THEOREM 8.2. If the terms of > Un are real, then pm un € 1™* if and only if 
(8.21) lim lim sup max minimum (—1)’*(s, — sp) = 0. 

a—l—,B>1l+ k-00 = 7p= 2 akc p<a<pk 

It can also be shown that (8.21) may be replaced in Theorem 8.2 by the 
condition 
(8.22) lim lim sup max minimum (—1)**(s, — s,) = 0. 

A+1+ kos |= kn <dk 
When the conditions (8.21) and (8.22) are made more restrictive by removal of 
the expressions involving j, they reduce to the characterizations (7.02) and 
(7.021) of the “langsam abfallend”’ series of Schmidt. 


9. Tauberian theorems for C, summability. We now prove the two follow- 
ing theorems of which the first is a Tauberian convergence theorem and the 
second is a Tauberian oscillation theorem. 

THEOREM 9.1. Jf o, — o and r une T*, then 8, — o. 

THEOREM 9.2. If0 < A <2/2,€>0,0<a<1<B < ~,k) > 0, and 
ti, Ye, --- are such that when k > ko, 


(9.21) minimum R(s, — s,e"*"*” > —e |&| SA, 
ak<p<a<pbk 

then for each complex number A, 

(9.22) lim sup |s, — A| S [e + C lim sup|o, — A|] cosec A 


where C is the greater of the numbers (1 + a)/(1 — a) and (6 + 1)/(8 — 1). 

The first theorem is obviously a corollary of the second, for if >) uw» « 7* and 
o, — 0, we can set A = o and choose « as near 0 as we please. We give a 
direct proof of Theorem 9.2 which is similar to (and not essentially more difficult 
than) standard proofs of Tauberian theorems for classes smaller than 7*. Let 
k’ = [ak] and k’’ = [8k]. Then for each sufficiently great k the identity 


(9.23) 8 + 8 + +--+ + 8 = (81 + 8 +--+ + See) + (Seg + °° + 8) 
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can be written in the form 
k 


ko; = k’ow + (k — k’)s, — ) Ti (s — 8p), 


p=k'+1 
and division by k — k’ gives 
(9.25) s& = Ut Vi 
where 
_ ko; = k’ox. er 1 . 7 
(9.26) U; — “naw ° V; —— k— &’ ae (s% 8p) 


If we set L = lim sup | ¢, |, then 
lim sup | Us| S lim sup [(& | o¢| +4’ | on |)/(k — &’)] 

< L lim [(k + k’)(k — k’)] = LO + @)/(1 — a) SCL. 
Using (9.21) gives for k > ky and | &| S$ A 


k 
RV, = : > (-d) =e 


k — k’ p=k!l+1 





Hence 


(9.27) lim inf Rs,e*"***” > —e — CL. 


ko 


Starting with the identity obtained by replacing k by k’”’ and k’ by k in (9.23), 
a similar argument shows that 


lim sup Rs.e*"**™ < ¢€ + CL. 


k—>00 
Hence 
lim sup | Rs,e""**” | < € + CL. 
k—0o 
If we set g = arg s, so that s, = | s, | exp ig; , this becomes 


lim sup | s. | | cos (gx + Ye + 5) | Se + CL. 


assume a value which differs from each odd multiple of +/2 by at least A; hence 


As 6, ranges over the interval — A < & S A, the angle (yg, + Ys + 6) must 


lim sup | s,| cos (r/2 — A) Se +CL 


and 
(9.28) lim sup | s, | S [e + C lim sup | a, |] cosee A. 


If s, is replaced by s, — A, then (9.21) still holds and o, is replaced by on — A; 
therefore (9.28) holds when s, and o, are replaced by s, — A and o, — A Te 
spectively, and Theorem 9.2 is proved. 
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10. The Tauberian class of Pitt. In §9 we gave what is perhaps the simplest 
“direct” proof of C, Tauberian theorems for the class 7*. A more indirect 
proof of Ci Tauberian theorems for the class 7* leads naturally to the general 
Tauberian classes of Pitt’* and to generalizations of them. 

Let >> un € 7* and let ¢ > 0. Then there exist 0 < a < 1 < 8, a sequence 


yi, ¥2, °°: , and an index kp such that when k > ko and | 6 | S A 

Ris. — 8,)e > ~—¢ ak <p <q < Bk. 
In particular, when k > ko and | & | S A, 
(10.01) R(sy — eV” > —¢ k <p < Bk 
and 
(10.02) Rs, — spe" > —e¢ ak <p <k. 


Our next step is to let A denote an arbitrary complex number; to replace s, by 
sp - Aand s; by s, — A in (10.01) and (10.02); and then to set g, = arg(s, — A) 
(our notation does not take into account the fact that g, depends on A) so that 
(10.01) and (10.02) become respectively 


(Sp a Ajeet») > —-et+ | 8% —A | COS (ox + We + 5;.) k< p< Bk 


and 
—— (Wetde te) ar es Ye ‘0S 
(sp — Ale >-e-—|s —Alcs( ty +h) ak<p<k. 


As 6, ranges over the interval —A S 6, S A, the angle (g + YW. + 6.) must 
assume a value w, differing in absolute value from each odd multiple of 1/2 
by at least A; if possible, choose 6, such that | 6, | < A and «; lies in the first 
or fourth quadrants, and then set 0, = x + 4, ; otherwise choose 6, such that 
|6.| < A and a lies in the second or third quadrants, and then set 6 = Yx + 
5. + m. These choices of 6, and 6 give 


(10.03) R(s, — A)e™* > —e + |'s, — A| cos (x/2 — A) 


for each p in at least one of the two ranges k < p < Bk and ak < p < k. 

Let, for each p in the interval 0 < p < 1, 7; denote the class of series }> un 
whose partial sums s, satisfy the following condition: corresponding to each 
¢ > 0, there exist numbers a and # such that a < 1 < 8B, asequence 4; , 02, ---, 
and an index ky such that when k > ko, the inequality 


(10.04) Rsye"* > —e+ p| s% | 


holds for each p in at least one of the two ranges ak < p < kandk < p < Bk. 
The argument leading to (10.04) establishes the following theorem in which A 
is the constant involved in the definition of 7*. 





“H.R. Pitt, General Tauberian Theorems, Proc. London Math. Soc. (2) vol. 44 (1938), 
pp. 243-288. We refer to this paper as G.T.T. To compare the classes of G.T.T. with 
ours, it is necessary to associate a sequence 8, with a step function s(x) defined by s(x) = 
8 + Se + +++ + 82) and make an exponential change of variable. 
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Tureorem 10.1. If a series >) un with partial sums 8, is in class T*, then for 
each constant A the series (u, — A) + Ue + --- with partial sums s, — A is in 
the class 1’, for which p = cos (4/2 — A). 

It is clear that if p’ < p, then 7,, > T,. Let T’ denote the union of the 
classes is for0 <p <1. Then T* C 7’. If we modify the definition in- 
volving (10.04) by allowing p to depend on e, we obtain a class P of series which 
is identical” with the class of series satisfying condition T of G.T.T. pp. 
244-245. 

DEFINITION 10.2. Let P denote the class of series »» Un whose partial sums s, 
satisfy the following condition: corresponding to each « > O there exist numbers 
p, a, and B such that p > O and a < 1 < B; a real sequence 6; , 62, --- ; andan 
index ko such that when k > ko the inequality 


(10.21) Rs e"* > —e+ p| % | 


holds for each p in at least one of the two ranges ak < p < kandk < p < Bk. 

Obviously 7’ C P. Hence the relations T Cc T* C T’ and T; C T”’ imply 
that each Tauberian oscillation theorem of G.T.T. which applies to series in P 
applies a fortiori to series in the classes 7, T*, T, , and 7”. 


11. The classes 7’; and 7”. We now define still more general Tauberian 
classes by removing the requirement that (10.21) hold for each p in a range 
having k at one of the extremes. However we maintain (by restricting the 
sequences a, and £,) the requirement that (10.21) shall hold for each p in a 
range which is neither too short nor too remote from k. It will be clear that 
PCT" C Tz foreach E > 0. 

DeFINITION 11.1. Corresponding to each E > 0, let T's denote the class of 
series ), Un whose partial sums s, satisfy the following condition: there exist a 
constant p > 0; sequences a), and B, such thatO < a. < By, ka, > ©, and 


(11.11) lim sup (Bi + ax) / (Be — ax) =Fe< ©; 
a real sequence 0, ; and an index ko such that when k > ko 
(11.12) Rsye"* > —E + p| % | 


Let T” denote the intersection of the classes Tz for E > 0. 
TueoreM 11.2. If >< u,¢ Tz then 


(11.21) lim sup | s;| < [EZ + F lim sup | on {Jo 


n->oo 


where 8s, and a, are respectively the partial sums and the C, transform of 2, Un; 
and the constants F and p > 0 are any set for which (11.11) and (11.12) hold. 





16 It is easy to see that, apart from the difference in scale of the independent variable, 
the definitions differ only in appearance. 
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Using the notation of the definition of Tx, let k’ = [kay] and k” = [kp,J. 
Then the identity 


(11.22) (8: +--+ + See) — (81 + +++ + Ser) = Sieg + +++ + Ser 
leads, for k sufficiently great, to 


ke? 


Rk" aK — kone =R Dy spe 


p=k/+1 
(11.23) kere 
> DY (-E+ols|) = &” —k)(-E + pls) 
p=k!l+1 
so that 
(11.24) pls | SE + | kop — koe |/(k"’ — k’) 
and our result follows easily by use of the crude inequality | k’’o.,, — k’o. | S 


K" low | +B | on |. 

This proof of Theorem 11.2 is so simple and straightforward as to be almost 
trivial. Indeed it may seem that the general class 7’; , at which we arrived 
after several successive generalizations of the original Tauberian class of series 
for which nu, — 0, is one designed especially to make possible a simple and 
transparent proof. It is a significant fact that direct proofs of C, Tauberian 
theorems for the class of series for which nu, — 0 are simple and straight- 
forward; that such proofs for intermediate classes such as the class of series for 
which n | un | < K or the classes 7 or 7* are more devious and complicated; 
and that finally such proofs for the larger classes 7’, are again simple and 
straightforward. 

Several corollaries of Theorem 11.2 are easily obtained. In the first place, 
if }> u, € 7’; and o, is bounded, then s, must be bounded. If it is true that not 
only >> un € 7's but also the series (u, — A) + we + us + --- is in class 4 
for each A (and, by Theorem 10.1, in particular if = un € T*), then we can 
replace s, and o, in (11.21) by 8, we and o, — A respectively to obtain 


(11.25) lim sup |s, — A| S [E + F lim sup |o, — A |]p. 


Ifo, — o and (11.25) holds, then we can set A = o to obtain 
(11.26) lim sup | s —o| S E/p. 
k—e 


If it is also true that }° u, « 7” and that E and p can be chosen such that E/p 
is arbitrarily near 0 (and in particular if }> u, «7, for some p > 0), then 
(11.26) implies that > u, converges to o. 


12. Conclusion. In conclusion, we use the preceding study of Tauberian 
conditions as a basis for indication that the general problem of obtaining 
Tauberian convergence and oscillation theorems may be divided naturally into 
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four categories. Order among these categories is not significant, since each 
category is necessary for a complete theory. 

I. The first problem is that of discovery of significant classes S of series 
>> un of such a character that one may decide by inspection of the terms of 
>> un (and preferably with no knowledge whatever concerning properties of the 
sequence s, of partial sums of }> un) whether >) wn « S. 

II. The second problem is that of devising criteria to assist in showing that 
by Une. 

III. The third problem is that of showing that S is a subclass of a general 
class G of series for which Tauberian oscillation theorems can be proved by 
straightforward methods. To provide for Tauberian convergence theorems, it 
is also desirable to prove that if >> un ¢ S, then for each complex A the series 
(uw, — A) + uw + us + --- is in class G. 

IV. The fourth problem is that of establishing Tauberian theorems for the 
classes G. 

Problems of type IV are the ones which depend essentially upon the particular 
method of summability used. The attention of this paper has been centered 
almost exclusively on problems of the first three types. 


CorRNELL UNIVERSITY, 
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ANNALS OF MATHEMATICS 
Vol. 42, No. 1, January, 1941 


uch 
les 
of ON DIVERGENCE PROPERTIES OF THE LAGRANGE 
the INTERPOLATION PARABOLAS 
By P. Erpds 

mt (Received November 13, 1939) 
ral Throughout the present paper, —1 < ai” < AT ee 2 aX z\” < 1 denote the 
by roots of the n-th Tchebicheff polynomial 7’,(z), and unless otherwise stated it 
, it is understood that the fundamental points of the Lagrange interpolation are 
‘les the z{" It. is. well. known that? there exists a continuous function whose 

interpolation parabolas diverge everywhere in ( = 1, +1). In the present paper 
ihe we prove that for 2) = cos Pn, p = q = | (mod 2), (p:, g) = 1 there exists a con- 
lar tinuous function f(x) such that L,f(x0) > ~ *. Turn and I’ proved that this does 
red not hold for any other point. “In this direction Marcinkievicz’ proved that if 


the fundamental points.are the roots of U,(x) =-T4:(z) then for every contin- 
uous function f(x) and every point 2 there exists a sequence of integers n; < 
ng < +++ such that Ln, (f(x0)) — f(ao). We remark that in the case of the Fourier 
series it is well known that there always exists a subsequence of the partial 
sums converging to.f(x9).-. This fact. may be of interest because there is often an 
analogous behaviour of the~Lagrange interpolation parabola and the Fourier 





series. 
First we prove some Jemmas. 
Lemma 1. 
af” — 2 > & form2=n 
‘ 7 mi’ = ° 
Proor. Write 
2% —1 
as” = cos of”, d; = T. 
2m 
Then we have 
(m) (n) (m) oc. " 1 
|ai”” — 2} > |a" — 33" | sin — > — — > — aed. 
‘ i 1> [8 i | on” Sndma” m 


‘For the employed notations see P. Erdés and P. Turan, Annals of Math., Vol. 38 (1937), 
p. 142-155. If there is no danger of confusion we will omit the upper index n. 

*G. Griinwald, Annals of Math., Vol. 37 (1936), p. 908-918. 

*L(f(x)) denotes the Lagrange interpolation parabola of f(z). 

Rs This result was stated in the Annals of Math., Vol. 38 (1937), p. 155 but there was a 
misprint. 

* Acta Litt ae Scient. Szeged, Tom. 8, p. 127-130. 
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LemMa 2. Put 2 = cos 7 p = q = 1 (mod 2); then constants c, and cz exist 
such that 


_ (to — x{"| >= | Tn(x0) | > ce. 


| T(x) | = cos (Px) = cos ( _ i) > ce. 


Put a§” < a < x$f}; then 
: ‘: 24-1 
min jz — 21° | > © min (sin 21, 
jalaiicestin 2nq 2n 


LEMMA 3. 


DL’ | li” (ao) | < (log n) 
where » ing indicates that the summation is extended only over the x§” satisfying 


(rn) _ 1 
| a ~} > (log n)*” 
PROOF. 


(n) Ws T (20) (log n)* 
oe T.(xx) (to — 2x) n 


since | 7',(%0) | S 1 and T,(a,) = ae 2) = n, which proves the Iemma. 





(n) (n) 


Without loss of generality we may assume that % > 0. Let 2zj” < 2% < Zj41- 


Now we prove 


Lemma 4. Suppose 0 < a{” <2!” (ic, ~ tae i); then 


Proor. We have 








Pa) ee et A ee 
. n(Xe)(%o — Xx) | n(x — Le) n(2j41 — Tr) 
by Lemma 2. Now zj41- 2, < (7 +1-—- k)= es(j — Kk) , which proves the 
n n 


Lemma. 
Lemma 5. 


. ] 
Dee) | >a. 
(2k—1,n)=1 log log n 
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Proor. By Lemma 4 we have 


Deo) | > DO" [UPd | > es DO” 

(2k—1,n)=1 j ois k 
where the two dashes indicate that the summation is extended only over those 
n 
2 
on of the a” between 0. and z$”, thus 


Lj RS 7 > 5 Raage 1 


7—k 


k for which (2k — 1,n) = land= <k <j. Itis clear’ that there are at least 


where the three dashes indicate that the summation is extended only over those 
k which satisfy (2k — 1, n) = landj — k < eqn. 

Denote by v(n) the number of different odd prime factors of n. It is well 
log n 
log log n° 
the prime number theorem, but can also be obtained in an elementary way.) 
The number of integers k satisfying 7 — x < k < j, (2k — 1, n) = 1 equals by 

the sieve of Eratosthenes 


:->|2] + DB |=] - es, = «(i bis 1) — ar 


known that v(n) < ¢g (This result is an immediate consequence of 


pin LP pain LDQ pin Pp 
7 log n/log logn z 2a 
— 9¢8 log n/log log 
> aries > cw a for z - —/n, (p odd) 


since it is well known that [] (1 = ‘) >t * Thee 
pin p/ log log n 


> Drag 1 C10 1 , log n 


j—k ‘f log log n oe Se r aps log log n q.e.d. 





THEorREM 1. There exists a continuous function f(x) such that L,(f(20)) > 
Proor. Write : 


_ = Jfa(z) 
fz) = Zz VJ/log n° 








63 (n) (n) T 
Le. |2-4,—-2," |S -,7 =1,2,---. 
n 


z U 
. H denotes the number of the k’s in the interval j — z < k < j for which 2k — 1 is 


divisible by p. It is clear that : differs from = by less than 1. 
Pp 


*E. Landau, Uber den Verlauf der zahlentheoretischen Function. Archiv der Math. und 
Phys., Ser. 3, Vol. 5, (1903), p. 86-91. 
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fn(z) is defined as follows: 


frlcx”) = signum If” (2) for (2k — 1,n) = 1, 
(n) 1 
i % & 2" = 0, 


1 1 
in the intervals (0 ai” 4 +) and (20 a” — *): fn(x) is linear and else- 
where f,,(z) = 0. 


First we show that f(x) is continuous. It suffices to show that 


S n(x) 
n=no Vlog n 


is uniformly convergent, i.e. that 


fr(x) 


<.. 
n>n(e) Vlog n 


If for a certain y, fn(y) and fm(y), m > n are both different from 0, we have for 
a certain k; and ke 


n 1 m 1 P 
ler —y| < se lair — yl < sam 


(n) (m) 2 
Lk — Lbs | < oa: 


But by Lemma 1 


(n) (m) 1 
| te ~ ty | > = 


hence 2m’ > 2” i.e. m > n? for n > 3. Thus 


n(x) 1 
—— < —— js 
n>n(e) /log n r>r(e) Vlog >" = 


_ et fle) oii ia 
g(x) 2» Vlog r’ ¢o(x) ” oe ey 


L(f(ta)) = Lnler(e)) + Ly ( £2) + Lnler(2)). 
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First we show that L,(g2(z)) = 0. It will evidently suffice to show that for 
every k, g(x”) = 0 or that for r > 2, fick”) = 0. If for a certain 
r > n,f(ts”) # Owe have for a certain 1 


1 
| af” — xf” | < oF 


which does not hold for by Lemma 1 for 2” > r’. 
eid Next we estimate L,(g;(z)). - If for a certain x”, f(x”) # 0 then for a 
certain | 


| af” — xf” | 1 cae 


which by Lemma 1 means that 
2 <n® or r<2loglognforn > m. 
Thus if for a certain 2”, :(a§”) * 0 then by Lemma 2 


1 Cc 1 1 
|e a | i ours |a wo ya - r 22" > (log n)t for r >No 


r 


Thus by Lemma 3 
L(gr(ao)) < cre } | 1" (xo) | < ex(log n)! 


|z%—Z0 | > Clog n)-t 


> for 


Now by Lemma 5 


La(fa(xo)) = a | Us” (ao) | > ce 


since for (2k — 1, n) # 1f,(%) = 0. Thus finally 


log n 
log log n 


(log n)? _ ; 
Li(f(x0)) > ce log log n Cz (log n)* > ~. 


Similarly we could prove that a continuous f(x) exists such that L,(f(20)) 
converges to any given value. 


THrorEM 2. If x ¥ cos : x, Pp = q = 1 (mod 2) then there exists for every 


continuous f(x) a sequence of integers mn, <m < --- such that Ln,;(f(%0)) — f(20). 
Proor. First we prove that there exists a s@quence if integers m < m < --- 


such that | 7',,(a0) | < =. We need the following 
Lemma 6. If % ¥ a p = q = 1 (mod 2), then the inequality 


2r —1 
2n 


C14 
2 
Ny 











Xo — 


has an infinite number of solutions. 
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2r—1 
2n, 
Hence we may suppose that 2 is irrational. It is well known that the equa- 
~ i < 7, has an infinite number of solutions. If infinitely many of 
the b’s are even the Lemma is proved, if not consider the least positive solution of 


Proor. If 2 is rational it is of the form , thus the Lemma is trivia]. 


tion | Xo 


2ad — bf = 1. 


Obviously f = 1 (mod 2) and d < b thus 


which proves the Lemma. 


If nm — C4 


< — we have 
2n, Nh - 


wr Cu Ciz 
T n, (0) < cos t < “ . 


2r — 


Consider now a sequence of integers m..< mz. < --- with E - 


C14 


-;: We are going to prove that Ln,(f(%0)) — f(a). 


Nn 
For k ¥ r we have 
T n,(%o) c 


a 13 
| exo) | = pam — %) aa 





X |telee)| < 8 + —_ = o()," 


kr wt — Xo | 


hence from 


> L(x) =1 


k=1 





9 We have 


> 1 " y’ 1 


itpir TT 14-29 SClogn)-1 | Ze — Zo! 


1 
+ Fy < nlogn + cn log n = o(n?). 
|z%—Z9| >(logn)—! | te — 2| 


(The dash indicates that k = r is omitted.) 








Vial, 
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it follows that 


1,(2o) =l1- o(1). 
Thus 
Lny(f(0)) = f(ae)L,(a0o) + 2 S(e)le(ae) = (f(x) + [1 — o(1)] + (1) > f(x), 


which proves Theorem 2. 
On the other hand we can prove that for every z in (—1, +1) there exists a 
continuous f(z) such that 


Lo Lm(f(x0)) 


msn 


lim = We 


n= 
The proof is very similar to that of Theorem 1. 


Princeton, N. J. 
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ADDITIVE SET FUNCTIONS AND VECTOR LATTICES 


By S. Bocuner AND R. S. PHILLIPS 
(Received March 29, 1940) 


This paper arose out of an attempt to extend known results in the theory of 
countably additive set functions to the finitely additive case. The basic tool 
used in this investigation has been the vector lattice. We have found this 
approach helpful to the understanding of both additive set functions and vector 
lattices. ; ‘ae 

An important link between point functions and additive set functions is the 
theorem of Lebesgue that every set function which is absolutely continuous is 
the integral of a point function. Radon and Nikodym extended the theorem 
from ordinary Lebesgue measure to countably additive measure on general sets. 
If the measure is bounded this theorem states that in the Banach space of all 
absolutely continuous functions, step-functions are dense in the norm. It was 
shown recently by one of the authors (2) that the latter theorem remains valid 
for finitely additive Jordan measure in general; however in the course of the 
proof the more general case was reduced to the previous theorem of Lebesgue- 
Nikodym. In section II of the present note a new proof will be given. It will 
not presuppose any essential facts from the Lebesgue theory proper, being based 
on simple but important facts from the theory of vector lattices. These pre- 
requisites are assembled in section I. 

In section III we discuss some analogies between vector lattices and set func- 
tions; here again the approach rather than the result is new. The positive ele- 
ments are shown to be additive set functions on the Boolean algebra of normal 
subspaces. As a consequence finitely additive set functions become completely 
additive on this algebra. It is significant to observe that this extension of the 
original algebra can also be expressed in terms of the stochastic distance. We 
further show that a vector lattice may be embedded in a second vector lattice 
having the same Boolean algebra of normal subspaces and possessing a unit 
element. Finally we consider briefly the question of a generalized base. 


I. ProJEcTIONS IN VecTOR LATTICES 


A space L will satisfy the following five postulates: 

I: L is a linear space with real scalars, and a relation x > 0 is defined on L. 
Il: Ifz >Oandy >0,thenz+y> 0. 

Ill: If z > 0 and @ is a scalar, then a > 0 implies ax > 0 and conversely. 
IV: Relative to the given order relation (r > y means x — y > 0), Lisa 
lattice. 

We will write: xv y for sup (a, y), xay for inf (z, y); x* for xv0, x for 
—2xv0, and | z | forz* + x; the least upper bound (if it exists) of a set E will be 
denoted by Vex and the greatest lower bound by Azz. 
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\: If a set E is bounded above (below) then Ver(Azx) exists. 

Proof of the following theorems can be found in a paper by Freudenthal (8): 
ren —2.;|2|=— eye = z’v2;2—-a+-2and2—ar (a > 0) are 
lattice automorphisms; z — —2 is a lattice anti-automorphism;zv y+ ray = 
rtyj;zv(yaz) = (cvy) A (vz); TA(yvz) = (rAy) Vv (rAZ2); a > O 
implies aV pr = Vear, Ve,ravy = VexvVVey, Ve, rcay = VexAVry, Ve,r(z + y) = 
V,z + Vzy (if the least upper bounds on the right side exist); and dual state- 
ments for the greatest lower bounds. 

x will be said to be disjoint from y if |z|a|y|=0. Given a subset Y of L, 
Y' will be the set of all x e L disjoint from every ye Y. Iterating the process, 
we define Y” to be (Y’)’ and similarly for Y’” ete. 

As Y” contains all elements of Z disjoint from every element of Y’, Y” > Y. 
IfYDZ,then Y’C Z’. Inparticular Y’’C Y’. It follows that Y’ = Y’" = 
y°=.-.-and Y” = Y” = Y’ =..., The sets Y and Y’ have at most the 
null vector in common. , 

A subset E of L which contains along with x an element v such that 2|z| < v 
will be called a D-set. 

A subspace of Z satisfying postulates I, II, III, IV, and V for sets bounded 
above (below) in L is called normal’ if it contains with x all y such that 
0<|y| sa). By a direct decomposition of L is meant a choice of disjoint 
complementary normal subspaces— that is, of normal subspaces S and 7 such 
that Sa 7 = 0 (null vector) and L is the direct sum of S and 7". © 

If c = 0 and if Y is a D-set, then we define 


ty = VuyeyD AY 


to be the projection of x on Y. This is a fundamental.notion for this paper. 
If Y is a normal subspace, then zy belongs to Y as it is*the least upper bound 
of a set of elements in Y. If x 2 O belongs to Y, then zy = Vyeyray = 2. 
Further if c = 0, then x — zy 2 0. 

The purpose of this section will be to show that Y’ and Y” define a direct 
decomposition of L and to study the case in which Y contains but a single 
element. 

TarorEM 1. If Y is any subset of L, then Y’ is a normal subspace. 

Suppose y is a fixed element of Y and the z; belong to Y’. Y’ is linear because 
‘t/a|y| = 0 implies |ar|aly| = 0 and 0 S |m + mlaly| &S 
(| 2r1|v| 2a |)aly| = (| 2a |aly|)v (| 2elaly|)=0. As ¥’C L, Iland 
III are valid in Y’. To establish IV and V let E be a subset of Y’ bounded 
above by ze L. Then by a repeated application of (uvv + wav = u + »), 
we obtain 


0S|Verlaly|<(Vee*)aly| = Vert +ly| — Ver? viy| 
= Vi(z*vily|t+2°aly|) — Ver viy| 
<Ve(x* v | y [yp'- Ve(a* aly |) — Vala v| y |) =0. 


‘Garrett Birkhoff (1) calls such a#4t4pace a complete normal subspace; F. Riesz (6) 
denotes it by famille compléte. : 
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The dual statement for boundedness from below can be similarly shown. 
Finally if | w| < |z|,thenO0 S|wlaly| S|zlaly| =0. 

Lemma 1. If Y is a D-set and if x 2 OeL, thenz — rye Y’. 

Let y 2 0 belong to Y. Then 


0S (x —a2y)Ay S (@ — Ay) ATAY 


lA 


(1 — Ly) ALy 


—2y + 2A Qty = —Ly + FAW yyr(ZAyY) 
= —2y + Vyer(Za2ray) 
= —Zty+ zy = 0. 

As an obvious consequence of the lemma we have 

(x — ty)y = 0 


and this justifies the name “projection.” 

THEOREM 2. If Y is a normal subspace of L, then Y = Y”. 

We can restrict ourselves to positive elements since if z « Y, then x* ¢ Y and 
xz eY. Ifx = Oc Y”, then by the lemma the element z — zy belongs to Y’. 
However, it also belongs to Y” since 0 S$ x — zy S xand Y” isnormal. Our 
element must therefore be the null element, and hence z = zy. But x was an 
arbitrary element of Y’’, and zy belongs to Y. 

THEorEeM 3. If Y is a normal subspace of L, then Y and Y’ define a direct 
decomposition of L, namely 


x = (ay — ty) + (ay — ay). 


Any decomposition x = 2; + 22 where 2; e€ Y and 22 € Y’ is certainly unique 
because Y and Y’ are linear and have only the null vector in common. It is 
therefore sufficient to show that for x = 0, = zy + zy. By lemma l, 
z—axzyeY’. Hence ry = (x — zy)y' = x — zy. Since Y = Y”, we like- 
wise have zy = x — ry. Thereforex = zy + 2y-. 

Frédéric Riesz (6) has demonstrated theorems 1, 2, 3 for the special case 
where L is the space of linear functionals on a semi-vector lattice. 

If Y consists of a single element y, then Y’”’ will be called a principal normal 
subspace and be designated by P(y). 

From the definition, P(y) = (y)’’, it follows that x P(y) if and only if 
|z|aly| = 0 implies |z|a|z| = 0. Now xeP(y) and |z|aly| = 9 
implies |z|=|xz|a|xz|=0. Henceif zx ¢ P(y) andif x ¥0,then|xz|a|y|> 
0. In Freudenthal’s terminology (3), | y| is a unit element of P(y). 

THEOREM 4. A necessary and sufficient condition that x belong to P(y) 1s that 
z= V,2*an|y| — Var an|y|. 

If x is of this form, x clearly belongs to P(y). To prove the converse we need 
only consider x = O«P(y). As the set {n|y|} is a D-set, it follows from 





ler 


bo 
as 
ti\ 
fu 


an 


ADDITIVE SET FUNCTIONS AND VECTOR LATTICES 319 


lemma 1 that c — V,xan|y| belongs to P(y)’. Being less than 2, it also 
belongs to P(y), and is therefore the null vector. 

Theorem 4 shows that elements of P(y) are approximable by elements 
bounded in |y|. The element y being fixed, boundedness in | y | is the same 
as a Lipschitz condition in the case of point functions. If L is the space of addi- 
tive set functions on a generalized Boolean algebra, P(y) is the class of all 
functions absolutely. continuous with respect to the set function y. Theorem 
3 then gives a decomposition of the function into an absolutely continuous part 


and a singular part. 
II. ABsoLUTELY ContiINuOoUs Set FuNcTIONS 


We now consider a finitely additive Boolean algebra of subsets E of a given 
set G, and on it a (finitely additive) Jordan volume which will be denoted by 

EF) or|E|. We assume that |G| = 

We consider the Banach space V; of all set functions F(#) of bounded varia- 
tion relative to finite partitions 6 = (EZ,). We define F > 0 if F(Z) 2 0 for 
all E and F(Z) > 0 for some EZ. With this definition, V; has all properties 
I-V of a space L. The properties I-III are trivial. Let [F.], a is an index, 
be a set of elements which are bounded from above by an element .G. In order 
to show the existence of supaF'. we take for any set HE any integer n, any set of 
indices a = a, --- , & (which need not be different), and any partition of E 
into disjoint sets ZH, , --- , EZ, , and we put 


(1) F(E) - SUPn,a,.By (Fa, (F:) ils F.,(En))- 


Since F,,(Z,) S G(E,) and G(E) is additive, we obtain F(Z) S G(E£) and 
therefore F(E) is finite. It is not hard to see that F(£) is additive. Also 
putting a, = --- = a, = a@ we obtain F(Z) = F.(E). Lastly if H(Z) = 
F.(E) for all a, then H(Z) = H(K;) + --- + H(E,) 2 Fa,(Fi) +--- + 
F.,(E,) and therefore H(Z) = F(E). The inf. can be obtained in a dual 
fashion. Thus properties IV and V are also satisfied. 

An element F of V, is called absolutely continuous if for any « > 0 there 
exists an 7 > 0 such that | E| < 7 implies | F(Z) | < «. The totality of such 
elements F will be denoted by AC. Obviously v(#) «AC. An element G of 
V, is called singular if to any « > 0 there exists a set E, for which | FE, | < ¢such 
that for any EZ, |G(E — E.)| < «. The totality of all such elements G will 
be denoted by S. It is easily seen that AC and S are each linear spaces with 
properties I—IV. 

Now, if F e AC and Ge S and F = 0 and G = 0, then 


inf (F, G) = infers erne(F(E’) + G(E”)) = 
Therefore 


Sa AC = 0 (null function) 
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and 
(2) S'’ D> AC. 


On the other hand if G = 0, Ge Vi, then 
inf (G, v) = inf ge4.n/r—n(G(E’) + v(E’’)) = 0 
if and only if GeS. Therefore S = (v)’ and (v)” = S’. Thus by (2) we obtain 


(3) (v)” D AC. 
Our next step is to show that 

(4) (v)” Cc AC 
and hence 


(v)” = AC = S’, 
Now (4) asserts that any element F 2 0 for which 
(5) F = lim,.. inf(F, nv) 


is absolutely continuous (theorem 4). Each element F,, = inf (F, nv) belongs 
to AC since 0 S F,(Z) S$ n|E|. Furthermore F,(E£) converges toward F(E) 
uniformly in £ since 0 S F(#£) — F,(£) S F(G) — F,(G), and therefore F(E) 
also belongs to AC. By theorem 1, AC and S are normal subspaces. 

It is now very easy to prove the following theorem: 

THEOREM 5. Step functions are dense in AC. 

Since for F = 0, 


| F — Fa lh = FG) — FAG) 


our argument shows that “bounded” functions are dense in AC inthe norm of 
Vi, and so it suffices to prove that step functions are dense in the manifold of 
bounded functions. Introducing the space V2 (see (2)), since the norm of a 
bounded element is larger in V2 than it is in V; it is sufficient to prove the 
Lemma 2. Step functions are dense in V2. 
If f(x) is a bounded integrable point function and if for some E we put 


1 
e= EI [ t@ dv 
then obviously 
[\s@ ~ eld = | is@) P dv — jal] El. 


We take a partition 6 = (E,), put E = E, and sum over v. This will give 
(6) IF — fell’ = IFIP — We IP. 
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Now let F be an arbitrary element of V2, 6 a fixed partition, and 6’ any parti- 
tion > 6. Since Fs (see (2)) is a step function, and F; = (F%3)s we obtain 


from (6) 
| rs - Fa) ||" = || Fe II? 4 \| Fe Il’ 


Taking the limit with respect to's’ we obtain by the definition of the norm 
(|. = Fs |P= | FIP — (Poll. 


Thus calle 
lim, || F — F;|| = 


and therefore F is the limit in the norm of step functions F;. 

A Banach space X will be said to possess a generalized base if there exists a 
directed set of linear tratisformations U,(z) on X to a finite dimensional sub- 
space of X such that (1) litasU;(x) = x for all x « X, and (2) || Us || < M for 
all 6. 

If X is the space AC/and!é = (E,) is a finite partition such that v(£,) ~ 0 
for any v, then define 


F(E,) 
WE,) 


It is clear from the proof of theorem 5 that lim;U;(F) = F in the norm topology. 
Further || Us(F) ||: S {| F ||:.. Therefore AC possess:a generalized base. 


UF) = ds v(E- BE). 





III. THe BooLteaAN ALGEBRA OF NORMAL SUBSPACES 


The class B of all normal subspaces of a vector lattice L with properties 
I-V is a Boolean algebra. This-known fact and-further information concerning 
B will be derived in the present section from properties of projections. 

In B the element L is the 1 element, the normal subspace consisting of only 
the null vector is the 0 element, and for a set. E = [A] of B, AgA-is the set 
intersection and V;A is the smallest element of B containing all elements A of 
E. Also if A is an arbitrary element of B, then the derived element A’, as 
defined in section I, is the complement of A in B in the sense that A a A’ = 0, 
Avy A’ = 1, and A” = A.° It remains only to prove that B satisfies the dis- 
tributive law. This will be done in theorem 8. 

In the next three theorems, a fixed*¢lass of elements [A.] of B will be con- 
sidered, and as an abbreviation we will denote a P-the normal subspace’ V.Aa , 
and by R the set sum of all elements of the Az. Obviously P D R,-and there- 
fore P’ C R’ and P’ = P D> R”. However since P is the smallest. normal 
subspace containing R, R’ DP. Therefore P-=-R” and P’ = R’. 

THEorEM 6. If [A] is any class of elements of B, then ~ 


(7) (V.A m = NaAa’ 
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and 
(8) (AcA a ‘= VA Gn 


The relation P’ = R’ implies that the following statements are equivalent: 
az € (VaAa)’;|2|A| aa | = 0 for all elements of R;x € Aq for all a;andzeA,A\,. 
This proves (7). (8) follows from (7) by taking the complement on both sides 
and replacing Aa by a. 

TuEeorEM 7. [If [Aq] is any class of elements of B and if x 2 0, then 


(9) Wa, = Valag 
and 
(10) Tra, = NaXa,: 


Now R is clearly a D-set. Therefore by lemma 1, z — zz € R’ = P’. Hence 
Llp =2—2p>x—2e. But PDR implies zp = Zz, so that rp = za. 
We have 


Br = Va,a,t Ada = ValVa,0 Ada] = Vara,’ 

This proves (9). (10) follows from (9) in the following way: 

Tra, = Lay = © — Bat = © — Vata: = x — Va(t — La,) 
= 2 — (& — Nata) = Nata, - 


THEOREM 8. Bis a complete Boolean algebra. If [Aa] and B are elements 
of B, then 


(VaAa) AB = Va(AaA B) 
and ‘io (AaAa) vB = Aa(Aav B). 
In fact for x 2 0, we have by theorem 7 
Dans) = Vataas = Valta,A Xs) = Vata, AXLe = Iva, A Le = Lwag)rs- 


Before giving the next theorem we need a definition. The sum of any set E 
of non-negative elements of L will be defined as follows: For a given ordering 
tz of E, suppose >), 2a to be defined for all 4 < yw. Define Da la = 
Vicu{ 20a Za] + 2,. Continuing this process until the set is exhausted, obtains 
an element >, 2,¢L. Suppose that 2’, is another ordering of E. We next 
show that })z, = Dia. It is easily shown that for any finite set 7 = 
(1, +++, Qn), > Ze >» Zq. Suppose now that for all A < yu, Da tS 

ta — Dr tq Where 7 is any finite set of a’s greater than or equal tou. Hence 
Vi cul Da te] S Dota — Dort,. Asp could also belong to z, this proves the 
induction. It follows that >) 2. < >> 2a. Similarly we could show that 
bt te 2 x ta. Our definition of sum therefore depends only on the set E. 
We will designate this sum by }>» z. 
On the basis of the previous theorem 8 we now have the 
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TurorEM 9. If x 2 O, then x4 defines a lattice homomorphism of B on L. 
In particular if [A a] 1s an arbitrary class of disjoint elements of B, then 


ta, = ie TA, - 


TueoreM 10. If Lhasaunity > 0, thatis L = P(y), then the homomorphism 
which is defined by ya (see theorem 9) ts an tsomorphism. 

Also, for each A €B, A = P(ya) and thus e = yx can be written in the form 
e = yr). Anelement e can be written in the form y, if and only if 


(11) OsSesy and eay—e=0. 


Proor: By theorem 9, A — y, is a homomorphism. Now if A — 0, then 
aeA,a = Oimplies yaa = 0. Since y is a unit this implies a = 0 and hence 
4 =0. Thus A — yg is an isomorphism. 

Since ys € A, we also have P(ys) C A. Thus P(y,4) = ¢(A) is a function 
from B to B, and ¢(A) is always a (proper or improper) part of A. However, 
o(P(ys)) = P(ya), and if A,, Az are disjoint then so are g(A;) and ¢(A2). 
Therefore g(A) = A; that is A = P(y,). Finally (11) is fulfilled fore = y,. 
If (11) holds then e e P(e) = A and y — e€ A’, so that e = y, by theorem 3. 

As a corollary to theorem 10 we have 

TurorEM 11. Jf L has a unit y > 0, then the Boolean algebra B of normal 
subspaces A is isomorphic with the Boolean algebra E of elements e of L for which 
(11) holds. 

Consider a generalized finitely additive Boolean algebra 7 of elements 7, a 
finitely additive positive set function y(r) on 7, and the class L of all finitely 
additive set functions on JT absolutely continuous with respect to y(7). Our 
theory suggests a method of extending 7 modulo null sets to a sigma Boolean 
algebra on which y(7) is completely additive. Now clearly y is a unit for L = 
P(y). Each 7) ¢ T defines a function y(7- 70) which in the notation of theorem 
11 belongs to E. If x > 0 €L, then it is easily shown that rpqi--+,)) = 2(7- 70). 
All sets of y-measure zero correspond to 0 in the Boolean algebra of normal 
subspaces. Finally it is necessary to add certain ideal elements of E which 
do not correspond to elements of 7 modulo null sets. By theorem 9 the set 
functions zr, are completely additive on B. In particular their values on the 
unit of T are completely additive. On sets which do correspond to 79, this 
value is precisely z(79). It is clear that by treating x* and z in this way x(r) ¢ L 
becomes a completely additive set function on the extended algebra E. 

Kakutani has obtained a similar result, namely, the given Boolean algebra 
can be embedded in a sigma field of point sets such that the finitely additive 
function can be extended on this new field to be completely additive. The new 
field is in this case difficult to determine. 

Now our extension modulo null sets of 7 to E is the smallest possible extension 
for which y(r) becomes completely additive. For suppose 7’ were another such 
extension. In this case the elements of E’ defined by (11) for the new lattice 
L' do correspond to elements of T’. As 7’ contains 7 modulo null sets, L’ is 
lattice homomorphic with L = P(y). Therefore E’ > E. 
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We definé the stochastic distance p(71, T2) = y(T1:72) + y(7y° 72). If we 
complete 7 in the usual manner by taking Cauchy sequences, we obtain a sigma 
field which is an extension modulo, null sets of 7 and on which y becomes com- 
pletely additive. As the metric extension of a sigma field modulo null sets 
‘relative to a completely additive set function is precisely the original field modulo 
null sets, it follows that the metric extension is likewise the smailest possible 
extension modulo null sets of 7’. Therefore E is also thé metric extension of T 
modulo null sets. | D2 

Now if L has no unit it is possible to obtain (by means of the choice siiieed 
a set of principal normal subspaces Ag = P(ya) such that VaAq = L, Aaa A; = 
Oif a ¥ 6, and y, > 0. “Putting z= x4, — x7, each element zx of L can then 
‘be represented by the set of components [ra], %2 € Aa. Not all possible combina- 
tions of components [xq] will occur ‘in ‘the representation. 

We now define a new vector lattice. L* to consist of all possible elements 
x = [xa], ta-¢-Aa.’ Defining the lattice operations in the obvious way, the 
new lattice satisfies postulates I-V, possésses a unit: element, namely [ye], and 
the original lattice L can be embedded into it. 

The Boolean algebra B* of normal subspaces of L* is lattice isomorphic with 
the Boolean algebra B of normal subspaces of L. B* ¢ B* is the direct product 
of the normal subspaces B,C A.. We correspond B* ¢ B* to VB; €B and 
A eBto[A A A,] «B*. By theorem 8, (V.B.) A Aq = Bgand V.(A'a A.) = A 
Hence this is a one to one correspondence. The correspondence clearly pre- 
serves order: | . 

We wish finally to consider the question of a generalized base (see section II) 
in vector lattices’Which possess a norm relative to which they are complete. We 
suppose given a set of normal subspaces A, each possessing a generalized base 
Us where the ||‘U || are uniformly bounded. We suppose further that VA = 
L;'Aat Ag = 0 if a ¥ 8B, and that the norm of x (t) = Dea n(ta,). These 
conditions are satisfied by vector lattices of type (L) (1) among which is the 
space V;. It now follows (5) that L itself possesses a generalized base. 
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FREE LATTICES’ 
By Partie M. WHItmMan 


(Received October 18, 1939) 


1. Introduction 


A lattice (sometimes called a structure) is a partially ordered set of elements 
each two of which have a greatest lower and a least upper bound, denoted 
Af\Band A UB, read “A meet B’” and “A join B.” Postulates’ for partial ih : 
ordering are that S, defined between some pairs of elements, satisfy i] 


(1) A SA forall A; 
(2) if A S Band B Ss A, then A = B; 
(3) ifA < BandB SC, thenA SC. 


A lattice is said to be generated by a set of elements X; (the “generators’’) if 
it consists of the X; and their finite combinations by MN and U—e.g., X, U X2, 
((X, U X.] M X;} U X,—sometimes known as “lattice polynomials.” 

However, these polynomials do not all constitute distinct elements; for in- 
stance, X; 1 X; = X; in any lattice by definition of M; furthermore, in a specific 
lattice we may have additional rules of equality; e.g. A N (B UC) = 
(ANB) U(ANC). a: 

The free lattice generated by the X; is’ a lattice generated by them in which 
there are no laws of equality except those derivable from the postulates for a 
lattice. This is the most general lattice generated by the X; , in the sense that 
every other can be obtained from it by a homomorphism—determined by. the 
additional rules of equality. s 

We are concerned here with the internal structure of a free lattice. Given 
two polynomials, how can we find which of <, =, = (if any) hold between 
them in the free lattice (equivalently, in all lattices)? This question is answered 
in §2. In §3 we show that given a polynomial, there is a shortest polynomial 





' Presented to the American Mathematical Society Sept. 5, 1939. 

* Equivalently, we may postulate for a lattice the identities L1: X UX =XNX =X; 
1:XUY=YUX,XNY=YNX; L3:XU(¥UZ) = (XUY)UZ, XN(YNZ) = 
(XN Y)N Z;L4: XU (XN Y) = XN (XUY) = X, note the theorem that X = XNY 
ifand only if Y = X U Y, and define ¥< Y as equivalent to X = XN Y or Y= XUY. 
Cf. Ore, On the Foundation of Abstract Algebra I,-Annals of Math., 36 (1935), p. 409. : 

* Its existence is guaranteed by a theorem of universal algebra; cf. G.._ Birkhoff, On the 4%, 
Structure of Abstract Algebras, Proc. Camb..Phil. Soc., 31 (1935), pp. 440-1. Compare free i 
groups. : ; 
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equal to it in the free lattice (which can be found by a definite procedure), and 
in §4 it is shown that certain elements cover others. 
The author is indebted to Prof. Garrett Birkhoff for many helpful suggestions, 


2. Conditions for A < Bin a free lattice 


Given two elements A and B, we should like to know: is A S B in the free 
lattice? Sometimes this is obviously true; e.g. Xi S X, U X.. Sometimes 
we can show it false, for by the abovementioned homomorphism, A < B jn 
the free lattice implies A < B in every lattice with the same generators. Hence 
if we can exhibit a specific lattice where A ¢ B, then A <¢ B in the free lattice. 
But this is a trial and error method; we should like some definite procedure for 
settling the question. 

THeoreM 1. In the free lattice generated by a set of elements X; , 

(4) X; < X; if and only if 7 = 7; 

(5) recursively, A < B if and only if one or more of the following hold: 


(a) A = A, U Az where A; S Band Az S B, 
(b) A= A, NM As where Aj < Bor Ag s B, 
(c) B = B, U B. where A < B,orA s B., 
(4) B = B, NM B. where A S Bi and A S By. 


Note. In (5a) it is permissible that A, be itself a join; ete. 

We see that this is the sort of condition desired, for, given A and B, we need 
look only at B and part of A, and A and part of B. Since the elements are the 
finite combinations of the generators, this process will eventually end with (4). 

Proor. These conditions are obviously sufficient; to prove them necessary 
we proceed by a series of definitions and lemmas. 

DEFINITION oF C. (6) X; C X; if and only if 7 = j; (7) A C B if and 
only if one or more of (5a-d) are true with S replaced by C. 

(8) Derinition. A > B if and only if B C A. 

Nore. The set of definitions (6)—(8) is self-dual; i.e., if M and U, C and > 
are interchanged the set remains the same. In particular, (7a) and (7d) are 
dual, (7b) and (7c) are dual, and (6) is self-dual. This property will enable us 
to omit many cases in proofs, where we need only make these same changes 
throughout. — 

(9) Derinition. A = B if and only if A C Band A DB. 

(10) Derinition. The length of A, denoted L(A), is the total number of 
X’s appearing in A, counting repetitions; e.g., L(X;) = 1, L(X; U X) = 2, 
L(t[X1 U X] nN X3} U Xi) = 4. 

With S as inclusion relation and = as equality the combinations of the X; 
form the free lattice; we now show that they also form a lattice with C and = 
in these roles; ef. (16). 

(11) Lemma. (a) A CA, (b) A CAUB, (c) A DANB for all A, B. 
Thus A U B is an upper bound to A and B under C. 

Proor by induction on L(A). (11a) is true for L(A) = 1, by (6). If (11a) 
is true for L(A) S< m, then so are (11b, c) by (7c, b). Hence (11a) holds for 


L(é 
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Ir 








— ee ~ ae 


FREE LATTICES 327 


L(A) = m + 1, for say A = A, U Az (dually if A = A, NM Az); then A; C A, 
4, CA by induction; “. A = Ai U A; CA by (7a). 


(12) Lemma. (a) Ai M As CX; tf and only if Ay C X; or Ap CX;. 
(b) X; © Bi U Be if and only if X; C Bi or X; C By. 
(c) Ai U Az C B if and only if A, C Band A, CB. 
(d) A CB, Nf/ B: tf and only if A C Biand A C By. 
(e) Ai N Ae © Bi U By tf and only if Ai N Az C B, or 
Aj a Ag C Bor A, Cc B, U B. or Ae CB,UB,. 

Proor. “If” is obviously true by (7). “Only if” is proved by induction on 
m = L(A) + L(B). It is true for m S 2 vacuously. Assume (12) true for 
m < k — 1; then for m = k, 

(12a) A, A, OC X;. Then A; C X; or Ap C Xj, as desired, by (7b), the 
only part of (7) which applies. 

(12c) Ay U Ae Cc B. Case 1. Ai U Ao Cc X;. Then Ai Cay, Ae CX; 
by (7a). Case 2. A; U A, C B, U B,. Then by (7a, ¢), (¢) A: C B and 
A, © B or (iz) A; U Az Cf B, or (72) A, U Ag Cc B, “eee (it) A, U Ae 4 B,, 
then Aj c By and As ea B, by induction (12c); me Aj ea B, U B, ’ As Cc B, U B, 
by (7c), as desired. Likewise if (cit) holds the lemma does, and if (2), then the 
proof is immediate. Case 3. Ai: U A, CB, NM Bz. Then (i) Ai CB, AN Be 
and Ay C B, NM By or (it) A; U Ag CB, and A, U A; C By, by (7a, d). If (), 
on. p. If (di), then Ay C B,, Az C B,, A; C Bz, Az © Be by induction (12c); 

~ as Cc B, NM B,, Ae Cc B, 1 Bs by (7d). 

(12b, d) dually. - 

(12e) Ai Nf Ae -— B, UB. Then A; Cc B, U B, or A> Cc B, U B. or Ay Nn 
Ag < B, or Ai nN Ae Cc B, by (7b, c). Q.E.D. 

(13) Lemma. If A C Band B CC, thn A CC. 

Proor by induction on m = L(A) + L(B) + L(C). True for m = 3 by (6). 
Induction: 

Case 1. Non-meet CB CC. 

Case la. X,;CX; CC. .t = j by (6). “ky = Xj. jae Ge oe 

Case 1b. X CB,UB, CC. ..X Csome B; by (12b). B; CC by (12c). 
..X CC by induction. 

Case le. X; CB, NB, C X;. Part of dual of case 1b. 

Case 1d. X CB,NB, cc,UC. 


”.B, N Be € some C; 
“.X C C; by induction. 
“.X CC, U C2 by (12b) or (7c). 


Casele A,UA,C BCC. .«.A;CB (all z) by (12c). 
induction. ..A,; U A, CC by (12c). 
Case If. X CBNB CONC. 
Case2. Meet CB CC. ; 
Case 2a. Ai MA, CB Cnon-join. Part of dual of case 1. 


or ened B; CC, U Cz by (12e). 

X C B; by (12d). 

X <C, U Cy by induction. 
“.A; CC (all 2) by 


Part of dual of case le. 


: 
: 
: 
: 
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Case 2b. AiNA,CXCC,UC,. some A; CX by (12a). A; CC, UC, 
by induction. ..A;M As CC, U C; by (128). 
Case 2c. A, NA, CB, UB, | ¢c,¢.. 


“Ai N Az C some B; or some A; C B, U B; by (12e). 
B; CC, U Cy by (12¢). “A; CC, U Ce by induction. 
“A, NM A, CC, U Cy by induction. “A; Az CC, U Cz by (128). 


Case 2d. Ai: NA, CB,NB,CC,UC,. Dual of case 2c. 

(14) Lemma. A U B is the least wpper bound to A and B under C; dually 
for ANB. : 

For it is an upper bound by (11), andif A C Cand B CC, then AUB CC 
by (7a). 

(15) Lemma. = is an equality relation.' 

(16) Lemma. The finite combinations of the X; by M and U form a lattice, 
generated by the X;, with C as inclusion relation and & as equality and A U B, 
A ff) B as least upper, greatest lower bounds to A and B. 

Proor. (1), (2), (3) are satisfied [(11), (9), (13)], and A U B is least upper 
bound by (14). 

PROOF OF THEOREM 1. Any other lattice generated by the X; is a homo- 
morphic image of the free lattice, hence S in a free lattice is sufficient for C. 
But by definition of U as least upper bound and induction, C is sufficient for < 
and hence = for =. Therefore C and S are equivalent in the free lattice. 
Theorem 1 then follows from (6) and (7). i 

Now denote A; U A, UU... UA, by Do? A;, A, N--- N Ag by TJ? Ai, or 
simply >> A; , [] A; if there is no confusion. 

(17) Corotuary. (a) [] A: CX; éf and only if some A; C X;. 

(b) X; C > B; if and only if X; C some B;. 
(c) >. A; CB if and only tf every A; CB. 
(d) A CI] B; if and only if A C every B;. 
(e) [] A: C >> B; #f and only if [] A: C some B; or some 
A, & 2 wy. 
(f) A; CU A;, Bs DI] B; alli. 
(g) >. A; is the least upper bound to A,, --- , An under C. 

Proor. By repetition of (12), (11). 

To answer “Is A = B?,” apply (2) and theorem 1. Conditions (12) or (17) 
are more convenient in practise than (5). (17b) is like the condition that a 
prime number divide a product. 


3. Canonical forms 


Having thus found one collection of elements equal to each other, and like- 
wise other collections, we should like to,choose as canonical forms one element 
from each collection. 





‘Cf. Schréder, Algebra der Logik, vol. 1, p. 184, or MacNeille, Partially Ordered Sets, 
Trans. Am. Math. Soc., 42 (1937), pp. 416-60, or it may be readily verified directly. 
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TuroreM 2. Of all the elements equal in a free lattice, there ts one of shortest 
ngth, unique except for commutativity and associativity. 

Proor. We show by induction on k = L(A) + L(B) thatifA > B, A F B, 
then C3:A & BC, and L(C) < L(A) or else L(C) < L(B). Then if the 
theorem were false, say A and B were alleged to be both of the shortest length, 
we could find a still shorter element unless A = B! It is true vacuously for 
= 2. Induction: 

Case 1. be Aia& > B;. Here, but not in the previous section, we assume 
4, and B; not themselves joins. > A: C DB; by (9). oA Iba c 
5: B, (all i) by (17). +. by (17e), for any i, (i) some a, C )) B; or (ii) 
A; C some B,. Note that (i) holds if A; = X,, by (176). Similarly, for 
any 7, (ii) some bc > A; or (ww) B; C some A,. 

If (i) holds for some 7, then a‘, Cc > B; Cc > A; by (9). A; C > An ; 
j#1. ‘ly U Asc 2 A;. Also A; C a\,, so a, A; Ca, U Dini Aj; 


le 


a, U Divi A; & DL A;and the theorem holds in this case; likewise if (7) 


holds for some 2. : 

Otherwise, (i7) and (iv) hold for all 7. .. for all 7, A; C some By, By C 
some A, i) [f, g need not be single-valued]. If 7 A: gif(i)] ¥ 2, then Ay C 
Bui C A; G # 1), Don«i An & An, and the theorem holds; similarly if 
flg()] # 2. If not, then f[g(t)] = 7, gif@] = 7 for all 7. «A; & Br, 
B; = Aga all 7. But DA: # > B; by hypothesis, ..p 3: Ap 4 By or 
B, # Ag) , Say the former. .. by induction, D3: A, > By = D, L(D) < 
L(A,) or L(D) < L(Byp), say the former. Then D U > re A,.= > A, and 
the theorem holds. 

Case2. [J] A:xJ] Bi. Dual. 


Case3. DA: » J] B;. ~. 2) Ai CJ] B; ; hence every A; C [J B; and 
Lace every B;. Also Il B,C z A;, 


..[] B; Csome A, or some B, C 9) Aj. 
A, C |] B; by above. > A; CB, by above. 


A,s2[[Bj2Dd Av. «BS DALTLB;. ovo. 

We take this unique shortest form as the canonical form. 

(18) Corotuary 1. A = >) A: = Ds (JJ; a}) U Dee X; (E any subset of 
1, --» , n) is canonical if and only if (a) noa’, C >) A, and (6) no A; C Donei An 
and (c) every A; is canonical. Dually for [J A;. 

Corollary 1 follows from the proof of theorem 2. From case 3 we might also 
require: no Ay & )) An, but this is included in (6). 

If }) A; is not cenonical, we can find the canonical element equal to it, for 
if (a) is false, then ai, U Det Ay =~ > A;j, if (0) is false, then Deets A,=> 

A,, if (c) is false then replace it by its canonical form; in any case we get a 
shorter, equal element, to which we can again apply the process. 
We can now build up a diagram of the free lattice step by step, starting with 
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the shortest elements. We could not hope to get it all at once since there are 
infinitely many distinct elements if there are more than two generators.’ 

(19) Corotuary 2. If > AS > B;, and > A; is canonical, then (a)B;C 
> Aj, all i, and (6) given i,j 3: Ai C B;. 

Proor. (a) follows from (12c), and if A; = X, then (6) follows from (12c, b), 
Otherwise A; = [];a} C >) B; by (12c). -. by (12e), either A; C some B, 
as desired, or some a; C > B:& > A;cand then by (18a) > A;is not canonical 
contrary to hypothesis. 


4. Covering theorems 


Derinition. A covers BifA > BandnoCd:A>C>B. 

We present some scattered results on the existence of such pairs of elements. 
Denote the free lattice of n generators by F,. In this section (but not pre- 
viously) we assume 7 finite. 

THEeorEM 3. In the free lattice of n generators, if A > (Xen XN (Sm X,), 
then A = >.” Xi or Doig Xi Or Doigeg Xi 

We first prove a 

Lemma. InF,,A = >, A: > Dosey Xi if and only if given i, 7 3: A; = X;. 

PROOF OF LEMMA by induction on L(A). Obvious for L(A) S n. Suppose 
L(A) = k > n, obviously redundant terms having been omitted; e.g., X, U X; 
contracted to X,. Also, by (17b), we may suppose no A; is itself a join. Then 
some A; has at least two factors, say A1 = [[ a;. Then 

~X:< DA Sa,UD A; (all s). 
isp il 
By induction, given i, j 3: j* term = X;. If for some s this term is an Ai, 
then the lemma holds; otherwise a, 2 X; for all s, and Ai 2 X;. 

Corotiary 1. InF,, >, Ai = >? X; if and only if given i,j 3: A; 2 Xi. 
D Ai = Diep Xi tf and only if given i (i ¥ p),j 3: A; = Xi, but no A; = Xp. 

Corotuary 2. InF,,, >o? X; covers Dviep Xi. 

Corotiary 3. InF,,, every A ¥ Do? Xi is S Doig Xi for some p. Proved 
by induction. 

The rest of theorem 3 is now proved in much the same way. 

Corouuary 4. If Fn, Drigp Xi covers (Dvigep Xi) M (Diigeg Xi), any 4. 

Corotiary 5. InF,, if A > Xp, then A = XyU ([Jinp Xi). 

THEOREM 4. LI Xj; covers > X; which in turn covers Que Xi) n 
(doing Xi), and X, U (T]izp X:) covers X,, in any lattice generated by the X; in 
which these elements are distinct. 

For any other lattice is a homomorphic image of the free one. 

The duals of these results are of course likewise true. 


HarvarpD UNIVERSITY 





5 Cf. G. Birkhoff, op. cit., p. 451. The distinctness of his elements in the free lattice 
can also be shown by (12). Note that, taking every sixth element, we get an infinite 
“chain”’ of distinct elements. 
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CORRECTIONS TO OUR PAPER ON THE EXISTENCE OF MINIMAL 
SURFACES OF GENERAL CRITICAL TYPES 


By Marston Morse anp C. Tompkins 


The following corrections should be made in our paper which appeared in 
volume 40 (1939) of these Annals. 

(1). In equation (5.14) interchange ¢(8) and y(a). 

(2). The argument on page 454 following equation (5.16) should be replaced 
by the following. The case a = 6 in (5.16) may be discarded. When a ¥ 8 
(5.16) occurs only when ¥(a) is constant on some interval. In this case the 
harmonic surface defined by ¥(a) is not a minimal surface (see Rad4, loc. cit., 
p. 75). The transformations used in §6 suffice in this case to prove that A(¢) 
is upper reducible at y. 

(3). Replace the first two paragraphs on page 460 by the following. A 
directly conformal 1-1 transformation of the disc r < 1 into itself induces a 
transformation 7'(6) of the circle r = 1 into itself. A curve [p] of Z will be said 
to be equivalent to g(a) if for a suitably chosen transformation 7'(@) of the 
above type, the transformation ¢(7'(@)) “defines’’ [p]. 

(4). In the fourth line following Lemma 6.1 replace “the point f(p) of 2” 
by a point f(p) of 2 equivalent to [p] and varying continuously with [p]. 


Tue INsTITUTE FOR ADVANCED Stoupy, 
PRINCETON UNIVERSITY. 
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